Analisis Sentimen terhadap Kebijakan Naturalisasi Timnas Indonesia Menggunakan IndoBERT dengan Perbandingan Augmentasi WordNet dan SMOTE

Adani, Dimas Dzaki (2025) Analisis Sentimen terhadap Kebijakan Naturalisasi Timnas Indonesia Menggunakan IndoBERT dengan Perbandingan Augmentasi WordNet dan SMOTE. Undergraduate thesis, Universitas Muhammadiyah Malang.

[thumbnail of PENDAHULUAN.pdf]
Preview
Text
PENDAHULUAN.pdf

Download (674kB) | Preview
[thumbnail of BAB I.pdf]
Preview
Text
BAB I.pdf

Download (224kB) | Preview
[thumbnail of BAB II.pdf]
Preview
Text
BAB II.pdf

Download (212kB) | Preview
[thumbnail of BAB III.pdf] Text
BAB III.pdf
Restricted to Registered users only

Download (548kB) | Request a copy
[thumbnail of BAB IV.pdf] Text
BAB IV.pdf
Restricted to Registered users only

Download (728kB) | Request a copy
[thumbnail of BAB V.pdf] Text
BAB V.pdf
Restricted to Registered users only

Download (172kB) | Request a copy
[thumbnail of POSTER.pdf] Text
POSTER.pdf
Restricted to Registered users only

Download (262kB) | Request a copy

Abstract

The naturalization of players for the Indonesian National Football Team has sparked diverse public opinions on social media platform X (Twitter), ranging from support for improving team quality to concerns about the marginalization of local players. To analyze public sentiment more comprehensively, this study employs the IndoBERT model with two data balancing approaches: WordNet-based augmentation (synonym replacement and random deletion) and the Synthetic Minority Oversampling Technique (SMOTE). A total of 7,866 tweets were collected through a crawling process using the keyword “naturalisasi Timnas Indonesia” (“Indonesian national team naturalization”). The results indicate that both balancing methods successfully improved the model’s performance, particularly for the Neutral class, which previously had the lowest F1-score. The WordNet + IndoBERT model achieved an accuracy of 83% with a macro average F1-score of 0.77, while SMOTE + IndoBERT obtained an accuracy of 81% and a macro average F1-score of 0.75. These findings suggest that both methods are effective in reducing bias toward the majority class, with WordNet performing slightly better due to its ability to enrich the semantic meaning of text. However, the model still faces challenges in distinguishing Neutral and Positive sentiments, likely due to the linguistic similarities and diverse expressions of public opinion on social media.

Item Type: Thesis (Undergraduate)
Student ID: 202110370311003
Keywords: Sentiment Analysis, Naturalization, IndoBERT, WordNet, SMOTE, Social Media, Twitter
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: 202110370311003 dimasdzaki05
Date Deposited: 03 Nov 2025 04:09
Last Modified: 03 Nov 2025 04:09
URI: https://eprints.umm.ac.id/id/eprint/24425

Actions (login required)

View Item
View Item