Clean Balance-Ensemble CHD: A Balanced Ensemble Learning Framework for Accurate Coronary Heart Disease Prediction
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2025.16.10.05Keywords:
Coronary Heart Disease (CHD) Prediction, Balanced Ensemble Learning, Preprocessing, Noise Reduction, Prediction AccuracyDimensions Badge
Issue
Section
License
Copyright (c) 2025 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Coronary Heart Disease (CHD) is still one of the leading causes of death worldwide, which necessitates early and reliable prediction methods to support timely medical interventions. Traditional machine learning approaches frequently struggle with noisy and imbalanced datasets which leading to biased predictions and reduced diagnostic reliability. To address these limitations, this paper proposes the CleanBalance-EnsembleCHD algorithm that combines data cleaning, balancing, and ensemble learning to improve prediction accuracy. The goal is to reduce noise, handle imbalance, and combine the strengths of multiple classifiers to detect CHDs more effectively. For noise reduction, the methodology employs Edited Nearest Neighbor (ENN) and Iterative Partitioning Filter (IPF), if imbalance persists Synthetic Minority Oversampling Technique (SMOTE) used. Five classifiers namely Rotation Forest, LogitBoost, Multilayer Perceptron, Logistic Model Trees (LMT), and Random Forest were trained, with the best models chosen for weighted soft-voting ensemble integration. The experimental evaluation on a CHD dataset with an initial class imbalance (maj/min ratio: 1.038, Gini index: 0.4998) revealed significant improvements. After ENN and IPF cleaning, the dataset was reduced from 1011 to 853 balanced instances (class counts: {1.0=414, 0.0=439}). Individual classifiers performed well, with accuracies of 97.36% (Rotation Forest), 94.72% (LogitBoost), 96.04% (Multilayer Perceptron), 97.95% (LMT), and 98.53% (Random Forest). After that, the top three models chosen Random Forest, LMT, and Rotation Forest were combined into an ensemble that outperformed all individual models on the test set, with Accuracy: 99.42%, F1-score: 0.9939, and MCC: 0.9884. These findings show that CleanBalance-EnsembleCHD provides superior predictive reliability leading to noise-resistant and balanced decision-making. Finally, the proposed framework provides a powerful and interpretable solution for early CHD detection using the potential to help clinicians with risk assessment and medical decision support.Abstract
How to Cite
Downloads
Similar Articles
- Balaji V, Purnendu Bikash Acharjee, Muniyandy Elangovan, Gauri Kalnoor, Ravi Rastogi, Vishnu Patidar, Developing a semantic framework for categorizing IoT agriculture sensor data: A machine learning and web semantics approach , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Temesgen Asfaw, Customer churn prediction using machine-learning techniques in the case of commercial bank of Ethiopia , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- Sindhu S, L. Arockiam, DRMF: Optimizing machine learning accuracy in IoT crop recommendation with domain rules and MissForest imputation , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- R. Gomathi, Balaji V, Sanjay R. Pawar, Ayesha Siddiqua, M. Dhanalakshmi, Ravi Rastogi, Ensuring ethical integrity and bias reduction in machine learning models , The Scientific Temper: Vol. 15 No. 01 (2024): The Scientific Temper
- S. Munawara Banu, M. Mohamed Surputheen, M. Rajakumar, Bio-Inspired and Machine Learning-Driven Multipath Routing Protocol for MANETs Using Predictive Link Analytics , The Scientific Temper: Vol. 16 No. 10 (2025): The Scientific Temper
- Ashish Nagila, Abhishek K Mishra, The effectiveness of machine learning and image processing in detecting plant leaf disease , The Scientific Temper: Vol. 14 No. 01 (2023): The Scientific Temper
- Monalisha Paul, Chaitali Kundu, Rudranil Bhowmik, Sanmoy Karmakar, Sandip K. Sinha, Nilanjana Chatterjee, The potential impression of fructo-oligosaccharides and zinc oxide nano composite against nicotine influenced cardiovascular changes , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- V. Seethala Devi, N. Vanjulavalli, K. Sujith, R. Surendiran, A metaheuristic optimisation algorithm-based optimal feature subset strategy that enhances the machine learning algorithm’s classifier performance , The Scientific Temper: Vol. 15 No. spl-1 (2024): The Scientific Temper
- Subna MP, Kamalraj N, Human Activity Recognition through Skeleton-Based Motion Analysis Using YOLOv8 and Graph Convolutional Networks , The Scientific Temper: Vol. 16 No. 12 (2025): The Scientific Temper
- Divya Goyal, Aksh Chahal, Aashi Bhatnagar, Vishakha, Sheetal Malhan, Vishwajeet Trivedi, Comparison of the acute metabolic and cardiovascular effects of electrical stimulation and voluntary exercise , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper
<< < 3 4 5 6 7 8 9 10 11 12 > >>
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Vimala S, G. Arockia Sahaya Sheela, Label-Aware Imputation with Cluster Refinement for Smartphone Usage Analytics in Educational Institutions , The Scientific Temper: Vol. 16 No. 12 (2025): The Scientific Temper

