Multi-objective nature inspired hybrid optimization algorithm to improve prediction accuracy on imbalance medical datasets

Published

17-08-2024

DOI:

https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.3.33

Keywords:

Class imbalance, Machine learning, Ensemble techniques, Sampling methods, Feature Selection

Dimensions Badge

Issue

Section

SECTION C: ARTIFICIAL INTELLIGENCE, ENGINEERING, TECHNOLOGY

Authors

  • Nithya R Department of Computer Science, St.Joseph’s College (Autonomous), Affiliated to Bharathidasan University, Tiruchirappalli,Tamilnadu,India.
  • Kokilavani T Department of Computer Science, Christ (Deemed to be) University, Nagasandra, Yeshwanthpur Campus, Bangalore, Karnataka, India
  • Joseph Charles P Department of Computer Science, St. Joseph’s College (Autonomous), Affiliated to Bharathidasan University, Tiruchirappalli, Tamil Nadu, India.

Abstract

Imbalanced medical datasets pose a significant challenge for predictive modelling. The current study presents a new method of performing feature selection specifically for the imbalanced medical datasets to improve accuracy of the predictions. The proposed Multi-Objective Feature Selection with Cost-Sensitive (MOFSCS)algorithm leverages the large-scale exploration capability of the Squirrel Search to generate diverse candidate feature subsets and employs Tabu Search for local optima refinement. One of the key developments is learning with consideration of costs, which is closer to the identification of the minority class. The effectiveness of the proposed approach is ensured by the experiments on different imbalanced medical datasets, namely, heart disease and stroke prediction datasets. The results reveal that the proposed method, when integrated with the XGBoost classifier, achieves a precision of 98.5%, recall of 98.7%, F1-score of 98.6%, accuracy of 98.7%, and an AUC-ROC of 98.7% on the heart disease dataset. Similarly, for brain stroke prediction, the model attains a precision of 98.9%, recall of 99.0%, F1-score of 98.9%, accuracy of 99.0%, and an AUC-ROC of 99.0%.

How to Cite

R, N., T, K., & P, J. C. (2024). Multi-objective nature inspired hybrid optimization algorithm to improve prediction accuracy on imbalance medical datasets. The Scientific Temper, 15(03), 2651–2662. https://doi.org/10.58414/SCIENTIFICTEMPER.2024.15.3.33

Downloads

Download data is not yet available.