Pengembangan Large Language Model untuk Analisis Tindak Pidana Perdagangan Orang di Indonesia

Anhari, Nizam Avif (2026) Pengembangan Large Language Model untuk Analisis Tindak Pidana Perdagangan Orang di Indonesia. Undergraduate thesis, Universitas Muhammadiyah Malang.

[thumbnail of PENDAHULUAN.pdf]
Preview
Text
PENDAHULUAN.pdf

Download (15MB) | Preview
[thumbnail of BAB I.pdf]
Preview
Text
BAB I.pdf

Download (6MB) | Preview
[thumbnail of BAB II.pdf]
Preview
Text
BAB II.pdf

Download (6MB) | Preview
[thumbnail of BAB III.pdf] Text
BAB III.pdf
Restricted to Registered users only

Download (6MB) | Request a copy
[thumbnail of BAB IV.pdf] Text
BAB IV.pdf
Restricted to Registered users only

Download (6MB) | Request a copy
[thumbnail of BAB V.pdf] Text
BAB V.pdf
Restricted to Registered users only

Download (6MB) | Request a copy
[thumbnail of LAMPIRAN.pdf] Text
LAMPIRAN.pdf
Restricted to Registered users only

Download (59kB) | Request a copy
[thumbnail of POSTER.pdf]
Preview
Text
POSTER.pdf

Download (258kB) | Preview

Abstract

The development of Large Language Models (LLMs) has demonstrated significant capabilities in understanding and generating text, but their application in the legal domain still faces serious challenges such as fact hallucination, linguistic bias, and low suitability for local jurisprudence structures. This research aims to develop a Legal-Case LLM, an open-weight large language model that is specifically fine-tuned for the analysis of jurisprudence on human trafficking in Indonesia. The dataset was compiled from more than 400 court decisions obtained from the Supreme Court Decision Directory through a process of scraping, document validation, structured metadata extraction, and the construction of synthetic question-and-answer (Q&A) data that was cleaned and validated in stages. The base model based on a decoder-only architecture (Gemma 3) was adapted using parameter-efficient fine-tuning (LoRA) techniques to improve training efficiency and domain suitability. The evaluation was conducted using automated metrics including ROUGE, BLEU, BERTScore, and BARTScore, as well as qualitative analysis of legal terminology alignment and hallucination tendencies. The results show that the fine-tuned model experienced a significant improvement in content recall, semantic alignment, and the use of legal terminology such as “court decision,” “panel of judges,” and “legal considerations” compared to the zero-shot baseline. In addition, the model was able to reduce factual errors in related outputs.

Item Type: Thesis (Undergraduate)
Student ID: 202210370311071
Keywords: Legal LLM, Human Trafficking Crimes, Indonesian Jurisprudence, Fine-tuning, Gemma, Artificial Intelligence in Law
Subjects: K Law > K Law (General)
Q Science > Q Science (General)
T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatics (55201)
Depositing User: 202210370311071 nizamanhari
Date Deposited: 11 May 2026 04:47
Last Modified: 11 May 2026 04:47
URI: https://eprints.umm.ac.id/id/eprint/29793

Actions (login required)

View Item
View Item