EMSMOTE: Ensemble multiclass synthetic minority oversampling technique to improve accuracy of multilingual sentiment analysis on imbalance data
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.4.17Keywords:
Sentiment analysis, Natural language processing, Multilingual dataset, Imbalance classification, SMOTE.Dimensions Badge
Issue
Section
License
Copyright (c) 2024 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Natural language processing (NLP) tasks, such as multilingual sentiment analysis, are inherently challenging, especially when dealing with unbalanced data. A dataset is considered imbalanced when one class significantly dominates the others, creating an unbalanced distribution. In many domains, the minority class holds crucial information, presenting unique challenges. This research addresses these challenges using an ensemble-based oversampling technique, EMSMOTE (Ensemble Multiclass Synthetic Minority Oversampling Technique). By leveraging SMOTE, EMSMOTE generates multiple synthetic datasets to train various classifiers. The proposed model, when combined with an ensemble random forest classifier, attained an impressive accuracy of 90.73%. This ensemble approach not only mitigates the effects of noisy synthetic samples introduced by SMOTE but also showcases significant enhancement in the overall performance in tackling class imbalances.Abstract
How to Cite
Downloads
Similar Articles
- Mansi Harjivan Chauhan, Divyang D. Vyas, Advancements in sentiment analysis – A comprehensive review of recent techniques and challenges , The Scientific Temper: Vol. 16 No. Spl-1 (2025): The Scientific Temper
- Ravikiran K, Neerav Nishant, M Sreedhar, N.Kavitha, Mathur N Kathiravan, Geetha A, Deep learning methods and integrated digital image processing techniques for detecting and evaluating wheat stripe rust disease , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- M. Jayakandan, A. Chandrabose, An ensemble-based approach for sentiment analysis of covid-19 Twitter data using machine learning and deep learning techniques , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- Abhishek Pandey, V Ramesh, Puneet Mittal, Suruthi, Muniyandy Elangovan, G.Deepa, Exploring advancements in deep learning for natural language processing tasks , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Sharanya Unnikrishnan, Eldhose Thomas, Arunima Dey, AI-Powered NLP in Vernacular Public Relations: Opportunities, Challenges, and Ethical Implications for India’s Multilingual Landscape , The Scientific Temper: Vol. 16 No. 10 (2025): The Scientific Temper
- Sharada C, T N Ravi, S Panneer Arokiara, Lancaster sliced regressive keyword extraction based semantic analytics on social media documents , The Scientific Temper: Vol. 16 No. 08 (2025): The Scientific Temper
- Sharayu Mirasdar, Mangesh Bedekar, Knowledge graphs for NLP: A comprehensive analysis , The Scientific Temper: Vol. 16 No. Spl-1 (2025): The Scientific Temper
- A. Sathya, M. S. Mythili, MOHCOA: Multi-objective hermit crab optimization algorithm for feature selection in sentiment analysis of Covid-19 Twitter datasets , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- S Selvakumari, M Durairaj, Performance Analysis of Deep Learning Optimizers for Arrhythmia Classification using PTB-XL ECG Dataset: Emphasis on Adam Optimizer , The Scientific Temper: Vol. 16 No. 11 (2025): The Scientific Temper
- K. Fathima, A. R. Mohamed Shanavas, TALEX: Transformer-Attention-Led EXplainable Feature Selection for Sentiment Classification , The Scientific Temper: Vol. 16 No. 11 (2025): The Scientific Temper
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Sindhu S, L. Arockiam, DRMF: Optimizing machine learning accuracy in IoT crop recommendation with domain rules and MissForest imputation , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Ayesha Shakith, L. Arockiam, Enhancing classification accuracy on code-mixed and imbalanced data using an adaptive deep autoencoder and XGBoost , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- S. Sindhu, L. Arockiam, A lightweight selective stacking framework for IoT crop recommendation , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper

