TY - JOUR AU - Bheemarao, Rashmi Kariganuru AU - Guruprasad, Hassan Sadashiva AU - Ravi, Shambhavi Bangalore PY - 2025 TI - KSWN (Kannada SentiWordNet): Developing a Sentiment Lexicon for Kannada using Translation and Word Embedding Techniques JF - Journal of Computer Science VL - 21 IS - 6 DO - 10.3844/jcssp.2025.1482.1489 UR - https://thescipub.com/abstract/jcssp.2025.1482.1489 AB - Opinion Mining has gained significant attention in recent years, especially due to the enormous growth of online content generation. However, finding the opinions expressed in comments and reviews is highly challenging in Indian regional languages due to the lack of annotated datasets. Opinion mining has predominantly been conducted in English, with recent efforts extending to Hindi and other languages. A primary resource in opinion mining is SentiWordNet, which aids in analyzing opinions by providing sentiment scores for words. Building a KSWN has been done to explore regional languages, as there is a notable absence of a comparable resource for Kannada. Thus, this study proposes creating a Kannada sentiment lexicon using a translation-based approach from various English sentiment lexicons. KSWN, a sentiment lexicon for Kannada developed using a translation approach, achieved an inter-annotator agreement, with Cohen's Kappa scores of 0.84 for positive words and 0.79for negative words as verified by two Kannada annotators. The Kannada SentiWordNet, initially created, may not cover all sentiment-bearing words, word embeddings are employed to capture semantic similarity. As a seed lexicon can be the foundation for tagging a new corpus. Words in the new corpus are annotated by matching them with the seed list. New words with similar sentiment profiles are identified by applying similarity measures to the embedded word representations. These newly identified words are then added to the lexicon, further enriching it for sentiment analysis tasks.