Automatic Question Generation for 5W-1H Open Domain of Indonesian Questions by Using Syntactical Template-Based Features from Academic Textbooks

basuki, setio and Kusuma, Selvia Ferdiana (2018) Automatic Question Generation for 5W-1H Open Domain of Indonesian Questions by Using Syntactical Template-Based Features from Academic Textbooks. Journal of Theoretical and Applied Information Technology (JATIT), 96 (12). pp. 3908-3923. ISSN 1817-3195

[thumbnail of basuki kusuma - automatic question generation coarse-class classification fine-class classification open domain question syntactical approach.pdf]
Preview
Text
basuki kusuma - automatic question generation coarse-class classification fine-class classification open domain question syntactical approach.pdf

Download (326kB) | Preview
[thumbnail of similarity - basuki kusuma - automatic question generation coarse-class classification fine-class classification open domain question syntactical approach.pdf]
Preview
Text
similarity - basuki kusuma - automatic question generation coarse-class classification fine-class classification open domain question syntactical approach.pdf

Download (4MB) | Preview

Abstract

The measuring of education quality in school can be conducted by delivering the examination to the students. Composing questions in the examination process to measure students’ achievement in the school teaching and learning process can be difficult and time consuming. To solve this problem, this research proposes Automatic Question Generation (AQG) method to generate Open Domain Indonesian Question by using syntactical approach. Open Domain questions are questions covering many domains of knowledge.
The challenge of generating the questions is how to identify the types of declarative sentences that are potential to be transformed into questions and how to develop the method for generating question automatically. In realizing the method, this research incorporates four stages, namely: the identification of declarative sentence for 8 coarse-class and 19 fine-class sentences, the classification of features for coarseclass sentence and the classification rules for fine-class sentence, the identification of question patterns, and the extraction of sentence’s components as well as the rule generation of questions. The coarse-class classification was carried out based on a machine learning with syntactical features of the sentence, namely: Part of Speech (POS) Tag, the presence of punctuation, the availability of specific verbs, sequence of words, etc. The fine-class classification was carried out based on a set of rules. According to the implementation and experiment, the findings show that the accuracy of coarse-class classification reaches 83.26% by using the SMO classifier and the accuracy of proposed fine-class classification reaches 92%. The generated questions are categorized into three types, namely: TRUE, UNDERSTANDABLE, and FALSE. The accuracy of generated TRUE and UNDERSTANDABLE questions reaches 88.66%. Thus, the obtained results show that the proposed method is prospective to implement in the real situation.

Item Type: Article
Keywords: Automatic Question Generation (AQG), Coarse-class Classification, Fine-class Classification, Open Domain Question, Syntactical Approach
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: maulana Maulana Chairudin
Date Deposited: 09 Mar 2024 01:08
Last Modified: 09 Mar 2024 01:08
URI: https://eprints.umm.ac.id/id/eprint/4598

Actions (login required)

View Item
View Item