Enhancing data imputation in complex datasets using Lagrange polynomial interpolation and hot-deck fusion
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2025.16.2.05Keywords:
Data Imputation, Hot-Deck Fusion, Hybrid Methods, Lagrange Polynomial Interpolation, Machine Learning.Dimensions Badge
Issue
Section
License
Copyright (c) 2025 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Data imputation is vital in preserving the quality of datasets in machine learning, where missing data leads to decreased model accuracy. This research proposes a new imputation method called Lagrange Polynomial Interpolation with Hot-Deck Fusion (LPIHD) to enhance the quality and reliability of imputed datasets, mainly when the data is multifaceted and comprises multiple types. LPIHD combines Lagrange Polynomial Interpolation and Hot-Deck Fusion. Lagrange Polynomial Interpolation estimates missing values using known data points. Hot-Deck Fusion refines these estimates by borrowing similar values from a donor population. This hybrid approach applied to two distinct datasets about wine quality and heart diseases, enhances precision by achieving lower MAE and RMSE values than those previously recorded. LPIHD achieved better accuracy for the wine quality and heart disease datasets, respectively, at varying rates of missing data. MAE and RMSE were also notably reduced across both datasets, affirming the method's efficacy. These findings suggest that LPIHD can produce better and more accurate data imputations, making it a helpful technique for the field that needs a strong analytical platform.Abstract
How to Cite
Downloads
Similar Articles
- Raja Selvaraj, Manikandasaran S Sundaram, ECM: Enhanced confidentiality method to ensure the secure migration of data in VM to cloud environment , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- Sahaya Jenitha A, Sinthu J. Prakash, A general stochastic model to handle deduplication challenges using hidden Markov model in big data analytics , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Sabeerath K, Manikandasaran S. Sundaram, BTEDD: Block-level tokens for efficient data deduplication in public cloud infrastructures , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Mohamed Azharudheen A, Vijayalakshmi V, Improvement of data analysis and protection using novel privacy-preserving methods for big data application , The Scientific Temper: Vol. 15 No. 02 (2024): The Scientific Temper
- S ChandraPrabha, S. Kantha Lakshmi, P. Sivaraaj, Data analysis and machine learning-based modeling for real-time production , The Scientific Temper: Vol. 14 No. 02 (2023): The Scientific Temper
- M. Iniyan, A. Banumathi, Brower blowfish nash secured stochastic neural network based disease diagnosis for medical WBAN in cloud environment , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Aruljothi Rajasekaran, Jemima Priyadarsini R., ECDS: Enhanced Cloud Data Security Technique to Protect Data Being Stored in Cloud Infrastructure , The Scientific Temper: Vol. 15 No. 04 (2024): The Scientific Temper
- Madhuri Prashant Pant, Jayshri Appaso Patil, Unlocking the potential of big data and analytics significance, applications in diverse domains and implementation of Apache Hadoop map/reduce for citation histogram , The Scientific Temper: Vol. 16 No. Spl-2 (2025): The Scientific Temper
- S. Udhaya Priya, M. Parveen, ETPPDMRL: A novel approach for prescriptive analytics of customer reviews via enhanced text parsing and reinforcement learning , The Scientific Temper: Vol. 16 No. 05 (2025): The Scientific Temper
- C. Premila Rosy, Clustering of cancer text documents in the medical field using machine learning heuristics , The Scientific Temper: Vol. 16 No. 05 (2025): The Scientific Temper
<< < 1 2 3 4 5 6 7 8 9 10 > >>
You may also start an advanced similarity search for this article.

