Fildzah, Lathifah (2024) Klasifikasi Risiko Penyakit Jantung Menggunakan Random Forest: Pengembangan Model Prediktif Melalui Hyperparameter Tuning. Undergraduate thesis, Universitas Muhammadiyah Malang.
PENDAHULUAN.pdf
Download (2MB) | Preview
BAB I.pdf
Restricted to Registered users only
Download (323kB) | Request a copy
BAB II.pdf
Restricted to Registered users only
Download (425kB) | Request a copy
BAB III.pdf
Restricted to Registered users only
Download (457kB) | Request a copy
BAB IV.pdf
Restricted to Registered users only
Download (386kB) | Request a copy
BAB V.pdf
Restricted to Registered users only
Download (297kB) | Request a copy
Abstract
The serious global health problem of heart disease places a heavy burden on the world's healthcare systems. Effective prevention and management depend on early identification and accurate risk assessment. In this study, the Random Forest algorithm and Grid Search CV hyperparameter tuning are used to anticipate the risk of developing heart disease. The main data source is the sizable "Heart Disease Dataset" from Kaggle. The goal is to increase the reliability and accuracy of the Random Forest model, enabling healthcare professionals to make defensible decisions based on consistent risk predictions. In order to manage missing values, remove unused columns, and standardize the dataset using Min Max scaling, data preparation is necessary. The initial model accuracy is 98.5%, however following hyperparameter tuning using a 5-fold cross-validated Grid Search CV, this accuracy rises noticeably to an amazing 100%. These findings demonstrate the Random Forest model's improved clinical decision-making capabilities and enhanced prediction efficacy. The study underlines how the Random Forest algorithm, together with hyperparameter tuning, has the potential to revolutionize the healthcare industry and increases our understanding of heart disease risk appraisal. Healthcare professionals may identify high-risk people and put the right preventative measures in place by providing a trustworthy, explicable prediction model. The study's ultimate goal is to lessen the worldwide burden of heart disease and improve the overall health effects for those who are impacted.
Item Type: | Thesis (Undergraduate) |
---|---|
Student ID: | 202010370311206 |
Keywords: | Heart disease, Random Forest, Grid Search CV, Machine Learning, Clinical information, Risk elements |
Subjects: | T Technology > T Technology (General) |
Divisions: | Faculty of Engineering > Department of Informatics (55201) |
Depositing User: | 202010370311206 fildzahlthfh |
Date Deposited: | 04 Apr 2024 04:59 |
Last Modified: | 04 Apr 2024 04:59 |
URI: | https://eprints.umm.ac.id/id/eprint/5441 |