EMSMOTE: Ensemble multiclass synthetic minority oversampling technique to improve accuracy of multilingual sentiment analysis on imbalance data
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.4.17Keywords:
Sentiment analysis, Natural language processing, Multilingual dataset, Imbalance classification, SMOTE.Dimensions Badge
Issue
Section
License
Copyright (c) 2024 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Natural language processing (NLP) tasks, such as multilingual sentiment analysis, are inherently challenging, especially when dealing with unbalanced data. A dataset is considered imbalanced when one class significantly dominates the others, creating an unbalanced distribution. In many domains, the minority class holds crucial information, presenting unique challenges. This research addresses these challenges using an ensemble-based oversampling technique, EMSMOTE (Ensemble Multiclass Synthetic Minority Oversampling Technique). By leveraging SMOTE, EMSMOTE generates multiple synthetic datasets to train various classifiers. The proposed model, when combined with an ensemble random forest classifier, attained an impressive accuracy of 90.73%. This ensemble approach not only mitigates the effects of noisy synthetic samples introduced by SMOTE but also showcases significant enhancement in the overall performance in tackling class imbalances.Abstract
How to Cite
Downloads
Similar Articles
- Mohamed Azharudheen A, Vijayalakshmi V, Improvement of data analysis and protection using novel privacy-preserving methods for big data application , The Scientific Temper: Vol. 15 No. 02 (2024): The Scientific Temper
- S. Vanaja, Hari Ganesh S, Application of data mining and machine learning approaches in the prediction of heart disease – A literature survey , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- V. Parimala, D. Ganeshkumar, Solar energy-driven water distillation with nanoparticle integration for enhanced efficiency, sustainability, and potable water production in arid regions , The Scientific Temper: Vol. 15 No. 01 (2024): The Scientific Temper
- Rashmi Chandra, Afroz Alam, Phytochemical Analysis Using X-ray Diffraction Spectroscopy (XRD) and GC-MS Analysis of Bioactive Compounds in Cucumis sativus L. (Angiosperms; Cucurbitaceae) , The Scientific Temper: Vol. 13 No. 01 (2022): The Scientific Temper
- Karan Berry, Shiv Kumar, Exploring the mediating role of gastronomic experience in tourist satisfaction: A multigroup analysis , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Syam Sundar. S, Direct reuse of scour and bleach effluent water for cotton knitted fabrics , The Scientific Temper: Vol. 14 No. 02 (2023): The Scientific Temper
- Nitika, Kuldeep Chaudhary, A critical review of social media advertising literature: Visualization and bibliometric approach , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- V. Yamuna , P. Kandhavadivu, Recent developments in the synthesis of superabsorbent polymer from natural food sources: A review , The Scientific Temper: Vol. 14 No. 02 (2023): The Scientific Temper
- Sawitri Devi, Raj Kumar, Unveiling scholarly insights: A bibliometric analysis of literature on gender bias at the workplace , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Suresh L. Chitragar, Measurement of agricultural productivity and levels of development in the Malaprabha river basin, Karnataka, India , The Scientific Temper: Vol. 15 No. 01 (2024): The Scientific Temper
<< < 1 2 3 4 5 6 7 8 9 10 > >>
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Ayesha Shakith, L. Arockiam, Enhancing classification accuracy on code-mixed and imbalanced data using an adaptive deep autoencoder and XGBoost , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Sindhu S, L. Arockiam, DRMF: Optimizing machine learning accuracy in IoT crop recommendation with domain rules and MissForest imputation , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- S. Sindhu, L. Arockiam, A lightweight selective stacking framework for IoT crop recommendation , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper