Enhancing classification accuracy on code-mixed and imbalanced data using an adaptive deep autoencoder and XGBoost
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.3.27Keywords:
Sentiment analysis, Deep learning, Code-mixing, Autoencoder, Imbalance classification.Dimensions Badge
Issue
Section
License
Copyright (c) 2024 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This study introduces a pioneering approach for enhancing classification accuracy on code-mixed and imbalanced data by integrating an adaptive deep autoencoder with dynamic sampling techniques. Targeting the intricate challenges of sentiment analysis within such datasets, this methodology employs an enhanced XGBoost classifier, optimized to leverage the nuanced features extracted by the autoencoder. The experimental evaluation across diverse datasets, predominantly involving Tamil-English code-mixed texts, demonstrates a notable improvement in performance metrics: accuracy reached 84.2%, precision was recorded at 74.8%, recall stood at 78.4%, and the F1-Score achieved 76.6%. This marks an enhancement over existing methods by 0.5% to 1.5%, substantiating the model's robust capability in effectively handling linguistic diversity and class imbalances. The novelty of this research lies in the seamless integration of dynamic sampling within the autoencoder's training loop, significantly boosting the adaptability and effectiveness of the machine-learning model in real-world applications.Abstract
How to Cite
Downloads
Similar Articles
- Isreal zewide, Abde S. Hajigame, Wondwosen Wondimu, Kibinesh Adimasu, Response of Bread Wheat (Triticum aestivum L.) Varieties to Blended NPSB Fertilizer Levels in Sori Saylem District, South-West Ethiopia , The Scientific Temper: Vol. 14 No. 02 (2023): The Scientific Temper
- Jayendra K. Singh, Gyan P. Singh, Sanjay K. Singh, Son preference and children sex composition in Uttar Pradesh: An empirical analysis , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- Kavitha V, Panneer Arokiaraj S., RPL-eSOA: Enhancing IoT network sustainability with RPL and enhanced sandpiper optimization algorithm , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Vinay Viratia, Sandeep Kumar, Shama Praveen, Tarang Shrivastava, Priyanka, Enhancing Trunk Control Balance in Children with Spastic Diplegic Cerebral Palsy: Comparative Effectiveness of the Vestibular Stimulation Technique and Standard Treatment , The Scientific Temper: Vol. 13 No. 02 (2022): The Scientific Temper
- Tarannum ., Anuja Pandey, Arti Rauthan, An evaluation of the impact of lean management practices on patients’ satisfaction at a small healthcare facility , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- Bhuvaneshwarri Ilango, A machine translation model for abstractive text summarization based on natural language processing , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Ayesha Shakith, L. Arockiam, EMSMOTE: Ensemble multiclass synthetic minority oversampling technique to improve accuracy of multilingual sentiment analysis on imbalance data , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper
- Sindhu S, L. Arockiam, DRMF: Optimizing machine learning accuracy in IoT crop recommendation with domain rules and MissForest imputation , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- S. Sindhu, L. Arockiam, A lightweight selective stacking framework for IoT crop recommendation , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper