EMSMOTE: Ensemble multiclass synthetic minority oversampling technique to improve accuracy of multilingual sentiment analysis on imbalance data
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.4.17Keywords:
Sentiment analysis, Natural language processing, Multilingual dataset, Imbalance classification, SMOTE.Dimensions Badge
Issue
Section
License
Copyright (c) 2024 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Natural language processing (NLP) tasks, such as multilingual sentiment analysis, are inherently challenging, especially when dealing with unbalanced data. A dataset is considered imbalanced when one class significantly dominates the others, creating an unbalanced distribution. In many domains, the minority class holds crucial information, presenting unique challenges. This research addresses these challenges using an ensemble-based oversampling technique, EMSMOTE (Ensemble Multiclass Synthetic Minority Oversampling Technique). By leveraging SMOTE, EMSMOTE generates multiple synthetic datasets to train various classifiers. The proposed model, when combined with an ensemble random forest classifier, attained an impressive accuracy of 90.73%. This ensemble approach not only mitigates the effects of noisy synthetic samples introduced by SMOTE but also showcases significant enhancement in the overall performance in tackling class imbalances.Abstract
How to Cite
Downloads
Similar Articles
- Gautam Nayak, Parthivkumar Patel, Developing speaking skills through task-based learning in English as a foreign language classroom , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper
- N Harini, N Santhi, Challenges and opportunities in product development using natural dyes , The Scientific Temper: Vol. 14 No. 01 (2023): The Scientific Temper
- Regasa Begna, Worku Masho, Wondosan Wondimu, Yaregal Tilahun, Tilahun Bekele, Benyam Tadesse, Haile Negash, Participatory evaluation and demonstration of productive performance of Bovans Brown chicken under village production system in Menit Shasha Woreda, West Omo Zone, Ethiopia , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- G. Chitra, Hari Ganesh S., Cultural algorithm based principal component analysis (CA-PCA) approach for handling high dimensional data , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- Anita M, Shakila S, Stochastic kernelized discriminant extreme learning machine classifier for big data predictive analytics , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- Isreal zewide, Abde S. Hajigame, Wondwosen Wondimu, Kibinesh Adimasu, Response of Bread Wheat (Triticum aestivum L.) Varieties to Blended NPSB Fertilizer Levels in Sori Saylem District, South-West Ethiopia , The Scientific Temper: Vol. 14 No. 02 (2023): The Scientific Temper
- M. Rajalakshmi, V. Sulochana, Enhancing deep learning model performance in air quality classification through probabilistic hyperparameter tuning with tree-structured Parzen estimators , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Arvind K Shukla, Balaji V, Dharani R, M Ananthi, R Padmavathy, Romala V. Srinivas, Precision agriculture predictive modeling and sensor analysis for enhanced crop monitoring , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Mohit, Rishi Chaudhry, Exploring the landscape of brand extensions: A bibliometric analysis of scholarly trends and insights , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Susithra N, Rajalakshmi K, Ashwath P, Performance analysis of compressive sensing and reconstruction by LASSO and OMP for audio signal processing applications , The Scientific Temper: Vol. 14 No. 01 (2023): The Scientific Temper
<< < 1 2 3 4 5 6 7 8 9 10 > >>
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Ayesha Shakith, L. Arockiam, Enhancing classification accuracy on code-mixed and imbalanced data using an adaptive deep autoencoder and XGBoost , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Sindhu S, L. Arockiam, DRMF: Optimizing machine learning accuracy in IoT crop recommendation with domain rules and MissForest imputation , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- S. Sindhu, L. Arockiam, A lightweight selective stacking framework for IoT crop recommendation , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper