Alianso, Arifin Surya and Syafaah, Lailis and Faruq, Amrul (2022) K-nearest neighbor imputation for missing value in hepatitis data. In: AIP Conference Proceedings. AIP Publishing. ISBN 1551-7616
Alianso Syafaah Faruq - Missing Value Imputation Hepatitis KNN Machine Learning.pdf
Download (560kB) | Preview
Similarity - Alianso Syafaah Faruq - Missing Value Imputation Hepatitis KNN Machine Learning.pdf
Download (874kB) | Preview
Abstract
There has been a growing occurrence of errors in a dataset, one of which is the incomplete data on an attribute or commonly acknowledged as a missing value, affecting the results of an analysis conducted for researchers. Attempt to address such issue includes the imputation, a method of filling in the missing value by Replacing the missing value with a possible value based on dataset information. This study aims to deal with missing values in albumin attribute hepatic data by utilizing K-Nearest Neighbor (KNN) imputation, performed by calculating the weight mean estimation for the number of K which has been determined. K is thus the closest observation, where in this study, the K that would be utilized is when K=3, K=5, K=7, K=9, and K=15. To determine the accuracy of an imputation, an evaluation is performed by utilizing the Mean Square Error (MSE). Based on the results obtained in this study, the best accuracy of program calculations is obtained when K=7 and the best MSE is achieved when K=15.
Item Type: | Book Section / Proceedings |
---|---|
Keywords: | Missing Value, Imputation, Hepatitis, KNN Machine Learning |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science T Technology > TK Electrical engineering. Electronics Nuclear engineering |
Divisions: | Faculty of Engineering > Department of Electrical Engineering (20201) |
Depositing User: | faruq Amrul Faruq |
Date Deposited: | 23 Apr 2024 05:52 |
Last Modified: | 23 Apr 2024 05:52 |
URI: | https://eprints.umm.ac.id/id/eprint/5706 |