Analisis Performa Teks Augmentasi Pada Sentimen Kebocoran Data Bank Syariah Indonesia Menggunakan IndoBERT

Tamam, Rosydan Amru (2025) Analisis Performa Teks Augmentasi Pada Sentimen Kebocoran Data Bank Syariah Indonesia Menggunakan IndoBERT. Undergraduate thesis, Universitas Muhammadiyah Malang.

[thumbnail of PENDAHULUAN.pdf]
Preview
Text
PENDAHULUAN.pdf

Download (1MB) | Preview
[thumbnail of BAB I.pdf]
Preview
Text
BAB I.pdf

Download (326kB) | Preview
[thumbnail of BAB II.pdf]
Preview
Text
BAB II.pdf

Download (306kB) | Preview
[thumbnail of BAB III.pdf] Text
BAB III.pdf
Restricted to Registered users only

Download (481kB) | Request a copy
[thumbnail of BAB IV.pdf] Text
BAB IV.pdf
Restricted to Registered users only

Download (697kB) | Request a copy
[thumbnail of BAB V.pdf] Text
BAB V.pdf
Restricted to Registered users only

Download (314kB) | Request a copy
[thumbnail of POSTER.pdf] Text
POSTER.pdf
Restricted to Registered users only

Download (241kB) | Request a copy

Abstract

The data leak that occurred at Bank Syariah Indonesia (BSI) in May 2023 triggered a variety of public opinions spread on X social media. This study aims to classify public sentiment towards the incident and analyze the effect of various data augmentation techniques on the performance of the IndoBERT model. The dataset used consisted of 24,000 tweets and was preprocessed and sentiment labeled automatically. The data imbalance problem is solved by four augmentation techniques, namely SMOTE, synonym replacement, back translation, and a combination of synonym replacement and back translation. Each augmented dataset is trained using the IndoBERT model and evaluated based on accuracy, precision, recall, and f1-score. The evaluation results show that the SMOTE technique provides the best performance with an accuracy of 87%, outperforming the other augmentation techniques. This is because SMOTE works on numerical representations without causing meaning distortion, making it more stable in supporting the performance of the IndoBERT model. This research contributes to the selection of effective augmentation methods for sentiment analysis in the context of unbalanced data in Indonesian.

Item Type: Thesis (Undergraduate)
Student ID: 202110370311249
Keywords: Sentiment Analysis, Data Leak, Bank Syariah Indonesia, Text Augmentation, IndoBERT
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: 202110370311249 rosydan_amru
Date Deposited: 30 Jul 2025 07:23
Last Modified: 30 Jul 2025 07:23
URI: https://eprints.umm.ac.id/id/eprint/20799

Actions (login required)

View Item
View Item