Giving more insight for automatic risk prediction during pregnancy with interpretable machine learning

Irfan, Muhammad and basuki, setio and Azhar, Yufis (2021) Giving more insight for automatic risk prediction during pregnancy with interpretable machine learning. Bulletin of Electrical Engineering and Informatics (BEEI), 10 (3). pp. 1621-1633. ISSN 2302-9285

[thumbnail of irfan basuki azhar - c5.0 algorithm correlation-based feature selection interpretability maternal mortality rate poedji rochjati screening card risk of pr.pdf]
Preview
Text
irfan basuki azhar - c5.0 algorithm correlation-based feature selection interpretability maternal mortality rate poedji rochjati screening card risk of pr.pdf

Download (1MB) | Preview
[thumbnail of similarity - irfan basuki azhar - c5.0 algorithm correlation-based feature selection interpretability maternal mortality rate poedji rochjati screening ca.pdf]
Preview
Text
similarity - irfan basuki azhar - c5.0 algorithm correlation-based feature selection interpretability maternal mortality rate poedji rochjati screening ca.pdf

Download (2MB) | Preview

Abstract

Maternal mortality rate (MMR) in Indonesia intercensal population survey
(SUPAS) was considered high. For pregnancy risk detection, the public health center (puskesmas) applies a Poedji Rochjati screening card (KSPR) demonstrating 20 features. In addition to KSPR, pregnancy risk monitoring
has been assisted with a pregnancy control card. Because of the differences
in the number of features between the two control cards, it is necessary to
make agreements between them. Our objectives are determining the most
influential features, exploring the links among features on the KSPR and
pregnancy control cards, and building a machine learning model for predicting pregnancy risk. For the first objective, we use correlation-based feature selection (CFS) and C5.0 algorithm. The next objective was answered by the union operation in the features produced by the two techniques. By performing the machine learning experiment on these features, the accuracy of the XGBoost algorithm demonstrated the hightest results of 94% followed by random forest, Naïve Bayes, and k-Nearest neighbor algorithms, 87%, 66%, and 60% respectively. Interpretability aspects are implemented with SHAP and LIME to provide more insight for classification model. In conclusion, the similarity feature generated in the two interpretation approaches confirmed that Cesar was dominant in determining pregnancy risk.

Item Type: Article
Keywords: C5.0 algorithm, Correlation-based feature, selection, Interpretability, Maternal mortality rate, Poedji Rochjati screening card, Risk of pregnancy
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
R Medicine > R Medicine (General)
T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: maulana Maulana Chairudin
Date Deposited: 09 Mar 2024 01:25
Last Modified: 09 Mar 2024 01:25
URI: https://eprints.umm.ac.id/id/eprint/4600

Actions (login required)

View Item
View Item