Diabetes Detection Using Extreme Gradient Boosting (XGBoost) with Hyperparameter Tuning

Dinanthi, Devi Aprilya (2024) Diabetes Detection Using Extreme Gradient Boosting (XGBoost) with Hyperparameter Tuning. Undergraduate thesis, Universitas Muhammadiyah Malang.

[thumbnail of PENDAHULUAN.pdf]
Preview
Text
PENDAHULUAN.pdf

Download (406kB) | Preview
[thumbnail of BAB 1_Devi Aprilya Dinanthi.pdf]
Preview
Text
BAB 1_Devi Aprilya Dinanthi.pdf

Download (157kB) | Preview
[thumbnail of BAB II_Devi Aprilya Dinanthi.pdf]
Preview
Text
BAB II_Devi Aprilya Dinanthi.pdf

Download (168kB) | Preview
[thumbnail of BAB III_Devi Aprilya Dinanthi.pdf] Text
BAB III_Devi Aprilya Dinanthi.pdf
Restricted to Registered users only

Download (228kB) | Request a copy
[thumbnail of BAB IV_Devi Aprilya Dinanthi.pdf] Text
BAB IV_Devi Aprilya Dinanthi.pdf
Restricted to Registered users only

Download (168kB) | Request a copy
[thumbnail of BAB V_Devi Aprilya Dinanthi.pdf] Text
BAB V_Devi Aprilya Dinanthi.pdf
Restricted to Registered users only

Download (137kB) | Request a copy
[thumbnail of 351-Article Text-1924-1-10-20240728.pdf]
Preview
Text
351-Article Text-1924-1-10-20240728.pdf

Download (453kB) | Preview

Abstract

Diabetes is a serious condition that can lead to fatal complications and death due to metabolic disorders caused by a lack of insulin production in the body. This study aims to find the best classification performance on diabetes dataset using Extreme Gradient Boosting (XGBoost) method. The dataset used has 768 rows and 9 columns, with target values of 0 and 1. In this study, resampling is applied to overcome data imbalance using SMOTE, and hyperparameter optimization is performed using GridSearchCV and RandomSearchCV. Model evaluation was performed using confusion matrix as well as metrics such as accuracy, precision, recall, and F1-score. The test results show that the use of GridSearchCV and RandomSearchCV for hyperparameter tuning provides good results. The application of data resampling also managed to improve the overall model performance, especially in the XGBoost method that has been optimized using GridSearchCV, which achieved the highest accuracy of 85%, while XGBoost with RandomSearchCV optimization showed 83% accuracy performance.

Item Type: Thesis (Undergraduate)
Student ID: 202010370311297
Keywords: Diabetes, XGBoost, SMOTE, Hyperparameter Tuning, GridSearchCV,RandomSearchCV
Subjects: R Medicine > R Medicine (General)
T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: 202010370311297 dinanthi140402
Date Deposited: 29 Jul 2024 07:33
Last Modified: 31 Jul 2024 08:10
URI: https://eprints.umm.ac.id/id/eprint/8911

Actions (login required)

View Item
View Item