Implementasi Algoritma Topic Modeling pada Data Cuitan Terkait Kesehatan Mental pada Platform Media Sosial Twitter (X) Berbahasa Indonesia

Effendy, Nico (2024) Implementasi Algoritma Topic Modeling pada Data Cuitan Terkait Kesehatan Mental pada Platform Media Sosial Twitter (X) Berbahasa Indonesia. Undergraduate thesis, Universitas Muhammadiyah Malang.

[thumbnail of PENDAHULUAN.pdf]
Preview
Text
PENDAHULUAN.pdf

Download (1MB) | Preview
[thumbnail of BAB I.pdf]
Preview
Text
BAB I.pdf

Download (223kB) | Preview
[thumbnail of BAB II.pdf]
Preview
Text
BAB II.pdf

Download (268kB) | Preview
[thumbnail of BAB III.pdf] Text
BAB III.pdf
Restricted to Registered users only

Download (303kB) | Request a copy
[thumbnail of BAB IV.pdf] Text
BAB IV.pdf
Restricted to Registered users only

Download (836kB) | Request a copy
[thumbnail of BAB V.pdf] Text
BAB V.pdf
Restricted to Registered users only

Download (86kB) | Request a copy
[thumbnail of POSTER.png] Image
POSTER.png
Restricted to Registered users only

Download (599kB) | Request a copy

Abstract

This research aims to implement topic modeling algorithms on mental health text tweet data, reviewing the literature on this topic reveals that previous research has a narrow focus, causing many issues to be overlooked. This paper addresses these issues by applying topic modeling methods across various mental health frames through several stages. First, the paper examines various mental health frames resulting in 7 main labels: 'Awareness', 'Feelings and Problematization', 'Classification', 'Accessibility and Funding', 'Stigma', 'Service', and 'Youth'. The second stage focused on compiling a dataset of 29,068 Indonesian tweets, by filtering tweets using the keywords “mental health” and “mental health”. In the third stage, this research conducted data preprocessing and manual labeling of 3,828 randomly selected tweets, which were chosen due to the impracticality of labeling the entire data and so that each randomly drawn data has an equal chance of not referring to any one label. Finally, in the fourth stage, topic modeling experiments were conducted involving two scenarios using three different algorithms. Evaluation was done using human interpretation and coherence score. The results of the topic modeling experiments show that the LDA and LSI algorithms are more effective than HDP in generating keywords relevant to each main label. LDA shows an increase in coherence as the number of clusters increases up to a certain limit. LSI shows mixed coherence performance depending on the number of clusters, while HDP has relatively stable coherence from 3 to 7 clusters.

Item Type: Thesis (Undergraduate)
Student ID: 202010370311467
Keywords: Mental Health, Topic Modeling, LDA, LSI, HDP
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: 202010370311467 nicoardiaeffendy23
Date Deposited: 28 Oct 2024 02:37
Last Modified: 28 Oct 2024 02:37
URI: https://eprints.umm.ac.id/id/eprint/11831

Actions (login required)

View Item
View Item