Implementation of Feature Selection Strategies to Enhance Classification Using XGBoost and Decision Tree

Nadya, Fhara Elvina Pingky and Ferdiansyah, M. Firdaus Ibadi and Nastiti, Vinna Rahmayanti Setyaning and Aditya, Christian Sri Kusuma (2024) Implementation of Feature Selection Strategies to Enhance Classification Using XGBoost and Decision Tree. Scientific Journal of Informatics, 11 (1). pp. 187-194. ISSN p-ISSN 2407-7658 e-ISSN 2460-0040

[thumbnail of Nadya Ferdiansyah Nastiti Aditya - Grade Decision tree Feature selection XGBoost.pdf]
Preview
Text
Nadya Ferdiansyah Nastiti Aditya - Grade Decision tree Feature selection XGBoost.pdf

Download (449kB) | Preview
[thumbnail of Similarity - Nadya Ferdiansyah Nastiti Aditya - Grade Decision tree Feature selection XGBoost.pdf]
Preview
Text
Similarity - Nadya Ferdiansyah Nastiti Aditya - Grade Decision tree Feature selection XGBoost.pdf

Download (1MB) | Preview

Abstract

Grades in the world of education are often a benchmark for students to be considered successful or not during the learning period. The facilities and teaching staff provided by schools with the same portion do not make student grades the same, the value gap is still found in every school. The purpose of this research is to produce a better accuracy rate by applying feature selection Information Gain (IG), Recursive Feature Elimination (RFE), Lasso, and Hybrid (RFE + Mutual Information) using XGBoost and Decision Tree models. Methods: This research was conducted using 649 Portuguese course student data that had been pre-processed according to data requirements, then, feature selection was carried out to select features that affect the target, after that all data can be classified using XGBoost and Decision tree, finally evaluating and displaying the results. Results: The results showed that feature selection Information Gain combined with the XGBoost algorithm has the best accuracy results compared to others, which is 81.53%. Novelty: The contribution of this research is to improve the classification accuracy results of previous research by using 2 traditional machine learning algorithms and some feature selection.

Item Type: Article
Keywords: Grade; Decision tree; Feature selection; XGBoost
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: christianskaditya Christian Sri Kusuma Aditya, S.Kom., M.Kom
Date Deposited: 03 May 2024 04:37
Last Modified: 03 May 2024 04:37
URI: https://eprints.umm.ac.id/id/eprint/6074

Actions (login required)

View Item
View Item