EMSMOTE: Ensemble multiclass synthetic minority oversampling technique to improve accuracy of multilingual sentiment analysis on imbalance data
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.4.17Keywords:
Sentiment analysis, Natural language processing, Multilingual dataset, Imbalance classification, SMOTE.Dimensions Badge
Issue
Section
License
Copyright (c) 2024 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Natural language processing (NLP) tasks, such as multilingual sentiment analysis, are inherently challenging, especially when dealing with unbalanced data. A dataset is considered imbalanced when one class significantly dominates the others, creating an unbalanced distribution. In many domains, the minority class holds crucial information, presenting unique challenges. This research addresses these challenges using an ensemble-based oversampling technique, EMSMOTE (Ensemble Multiclass Synthetic Minority Oversampling Technique). By leveraging SMOTE, EMSMOTE generates multiple synthetic datasets to train various classifiers. The proposed model, when combined with an ensemble random forest classifier, attained an impressive accuracy of 90.73%. This ensemble approach not only mitigates the effects of noisy synthetic samples introduced by SMOTE but also showcases significant enhancement in the overall performance in tackling class imbalances.Abstract
How to Cite
Downloads
Similar Articles
- M. A. Shanti, Optimizing predictive accuracy: A comparative study of feature selection strategies in the healthcare domain , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- P S Renjeni, B Senthilkumaran, Ramalingam Sugumar, L. Jaya Singh Dhas, Gaussian kernelized transformer learning model for brain tumor risk factor identification and disease diagnosis , The Scientific Temper: Vol. 16 No. 02 (2025): The Scientific Temper
- Shaik Abdulla P., Abdul Razak T., Retrieval-Based Inception V3-Net Algorithm and Invariant Data Classification using Enhanced Deep Belief Networks for Content-Based Image Retrieval , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- Naveena Somasundaram, Vigneshkumar M, Sanjay R. Pawar, M. Amutha, Balu S, Priya V, AI-driven material design for tissue engineering a comprehensive approach integrating generative adversarial networks and high-throughput experimentation , The Scientific Temper: Vol. 15 No. 01 (2024): The Scientific Temper
- V. Manikandabalaji, R. Sivakumar, V. Maniraj, A framework for diabetes diagnosis based on type-2 fuzzy semantic ontology approach , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Nithya R, Kokilavani T, Joseph Charles P, Multi-objective nature inspired hybrid optimization algorithm to improve prediction accuracy on imbalance medical datasets , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- J. Fathima Fouzia, M. Mohamed Surputheen, M. Rajakumar, Hybrid pigeon optimization-based feature selection and modified multi-class semantic segmentation for skin cancer detection (HPO-MMSS) , The Scientific Temper: Vol. 16 No. 05 (2025): The Scientific Temper
- REKHA KHANDAL, SHILPENDRA KOUR, RASHMI TRIPATHI, ANTIBACTERIAL ACTIVITY OF PHYTO-CHEMICALS OBTAINED FROM LEAFEXTRACTS OF SOME MEDICINAL PLANTS ON PATHOGENS OF SEMI-ARID SOIL , The Scientific Temper: Vol. 3 No. 1&2 (2012): The Scientific Temper
- Divya R., Vanathi P. T., Harikumar R., An optimized cardiac risk levels classifier based on GMM with min- max model from photoplethysmography signals , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Deepa Ramachandran VR VR, Kamalraj N, Hybrid deep segmentation architecture using dual attention U-Net and Mask-RCNN for accurate detection of pests, diseases, and weeds in crops , The Scientific Temper: Vol. 16 No. 07 (2025): The Scientific Temper
<< < 5 6 7 8 9 10 11 12 13 14 > >>
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Sindhu S, L. Arockiam, DRMF: Optimizing machine learning accuracy in IoT crop recommendation with domain rules and MissForest imputation , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Ayesha Shakith, L. Arockiam, Enhancing classification accuracy on code-mixed and imbalanced data using an adaptive deep autoencoder and XGBoost , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- S. Sindhu, L. Arockiam, A lightweight selective stacking framework for IoT crop recommendation , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper

