Sentiment Analysis Using Light Weight - Gradient Boosting Machine based Feature Selection
- 1 Department of Information Technology, Malla Reddy University, Hyderabad, India
- 2 Department of Data Science, Malla Reddy University, Hyderabad, India
- 3 Department of Electronics and Communication Engineering, Malla Reddy University, Hyderabad, India Srikanth Kadainti Department of Data Science, Malla Reddy University, Hyderabad, India
Abstract
Sentiment analysis is a significant task in Natural Language Processing (NLP) that differentiates the emotions and opinions expressed in text or reviews. The sentiment analysis is challenging due to the complex language patterns and inappropriate or redundant features used for classification. In this research, the Light Weight - Gradient Boosting Machine (LWGBM) based feature selection is proposed to choose relevant features for classification to eliminate inappropriate or redundant features and learn the complex language patterns. Then, the classification is performed by using H2O Automatic Machine Learning (H2O Auto ML) algorithm which classifies the sentiments as positive, neutral and negative with high accuracy. The performance of the proposed method is analyzed with different metrics: accuracy, precision, recall and f1-score. The proposed LWGBM and H2O ML method attains an accuracy of 95.39% on the Internet Movie Data Base (IMDB) dataset, and 92.41% accuracy on SemEval - 2016 dataset, which is more effective than the conventional methods namely, Extra-Long Neural Network (XLNet) and Arabic Bidirectional Encoder Representation Transformer (AraBERT).
DOI: https://doi.org/10.3844/jcssp.2025.1049.1058
Copyright: © 2025 Bikku Ramavath, Srikanth Kadainti and Nemani Subash. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 208 Views
- 57 Downloads
- 0 Citations
Download
Keywords
- H2O Automatic Machine Learning
- Light Weight - GradientBoosting Machine
- Natural Language Processing
- Sentiment Analysis