UMM Institutional Repository

EKSTRAKSI INFORMASI KESEHATAN MASYARAKAT DARI TWEET BERBAHASA INDONESIA BERBASIS KLASIFIKASI DENGAN ALGORITMA NAIVE BAYES

Rosikin, Khoirir (2018) EKSTRAKSI INFORMASI KESEHATAN MASYARAKAT DARI TWEET BERBAHASA INDONESIA BERBASIS KLASIFIKASI DENGAN ALGORITMA NAIVE BAYES. Bachelors Degree (S1) thesis, University of Muhammadiyah Malang.

[img]
Preview
Text
Pendahuluan.pdf

Download (1MB) | Preview
[img]
Preview
Text
BAB 1.pdf

Download (347kB) | Preview
[img]
Preview
Text
BAB 2.pdf

Download (333kB) | Preview
[img]
Preview
Text
BAB 3.pdf

Download (683kB) | Preview
[img] Text
BAB 4.pdf
Restricted to Registered users only

Download (1MB)
[img] Text
BAB 5.pdf
Restricted to Registered users only

Download (125kB)

Abstract

Health is a primary human need. In Indonesia there are health problems, namely the increase of infectious diseases and non-communicable diseases. To overcome this need to do precautionary measures. One effort to prevent disease, is to know the disease information, including about the causes and effects caused, so it can do prevention. Information can be obtained in various ways, one of which is taken from social media, especially twitter. Twitter is used because of the number of tweets produced resulting in big data phenomenon. Because of that, this research intends to perform an information extraction method. Information extraction is a method of application of data mining, especially the text mining field used to obtain information from a large collection of data. The information in question is a disease, effect, and cause. This research uses a classification-based information extraction approach with Naive Bayes algorithm. This research uses 7 feature sets and a model of classification algorithm that is Naive Bayes. In feature extraction there is imbalance dataset, so it is done resample filtering data. The test is done by 2 methods, namely model testing using 10-folds cross-validation and classification testing using 100 test data. The result of model test get the accuracy value 77,27% and the classification test get the accuracy value 74,16%.

Item Type: Thesis (Bachelors Degree (S1))
Student ID: 201310370311051
Keywords: Information Extraction, Classification, Naive Bayes, Natural Language Processing
Subjects: Z Bibliography. Library Science. Information Resources > ZA Information resources
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: Sulistyaningsih Sulistyaningsih
Date Deposited: 17 Nov 2018 01:45
Last Modified: 17 Nov 2018 01:45
URI : http://eprints.umm.ac.id/id/eprint/40233

Actions (login required)

View Item View Item
UMM Official

© 2008 UMM Library. All Rights Reserved.
Jl. Raya Tlogomas No 246 Malang East Java Indonesia - Phone +62341464318 ext. 150, 151 - Fax +62341464101
E-Mail : infopus[at]umm.ac.id - Website : http://lib.umm.ac.id - Online Catalog : http://laser.umm.ac.id - Repository : http://eprints.umm.ac.id

Web Analytics

UMM Institutional Repository is powered by :
EPrints Logo