Mesin Penerjemah Bahasa Hokkien-Indonesia dengan Fine-Tuning Quantized Low-Rank Adaptation (QLoRA) pada Large Language Model Meta AI (LLaMA)

Wijaya, Filbert

Mesin Penerjemah Bahasa Hokkien-Indonesia dengan Fine-Tuning Quantized Low-Rank Adaptation (QLoRA) pada Large Language Model Meta AI (LLaMA)

dc.contributor.advisor	Lydia, Maya Silvi
dc.contributor.advisor	Amalia
dc.contributor.author	Wijaya, Filbert
dc.date.accessioned	2025-07-16T03:48:57Z
dc.date.available	2025-07-16T03:48:57Z
dc.date.issued	2025
dc.identifier.uri	https://repositori.usu.ac.id/handle/123456789/105554
dc.description.abstract	Hokkien is a dialect within the Southern Min language group and one of low-resource languages (LRLs). Natural language processing (NLP) task such as neural machine translation (NMT) on LRL can apply fine-tuning to mitigate limitation of LRL parallel corpus. Previous study that did fine-tuning on a large language model (LLM) to support with Hokkien translation task has not supported Indonesian translation, hence, this study will use fine-tuning on LLM to develop a language model that supports Hokkien – Indonesian translation task. The method that will be used in this study is quantized low-rank adaptation (QLoRA) fine-tuning. QLoRA uses lower computing resources than full fine-tuning that requires more resources. The LLM that will be used in this study is the Taigi-Llama-2-Translator-7B model, a LLaMA 2-based model with 7 billion parameters. This study will use dictionary, terminology, news, and transcription text dataset from sites of Taiwan Ministry of Education, Hugging Face, National Yang Ming Chiao Tung University (NYCU). The evaluation metrics to evaluate model performance before fine-tuning and after fine-tuning in this study is BLEU and chrF++. Study results show the model supports Indonesian translation after fine-tuning with performance increase on Hokkien – Indonesian translation with average score of BLEU from 0.01 to 0.16 and average score of chrF++ from 3.48 to 23.77.	en_US
dc.language.iso	id	en_US
dc.publisher	Universitas Sumatera Utara	en_US
dc.subject	Hokkien	en_US
dc.subject	Indonesian	en_US
dc.subject	Low-Resource Language	en_US
dc.subject	Large Language Model	en_US
dc.subject	Neural Machine Translation	en_US
dc.subject	Fine-Tuning	en_US
dc.subject	Quantized Low-Rank Adaptation	en_US
dc.title	Mesin Penerjemah Bahasa Hokkien-Indonesia dengan Fine-Tuning Quantized Low-Rank Adaptation (QLoRA) pada Large Language Model Meta AI (LLaMA)	en_US
dc.title.alternative	Hokkien-Indonesian Translator with Quantized Low-Rank Adaptation (QLoRA) Fine-Tuning on Large Language Model Meta AI (LLaMA)	en_US
dc.type	Thesis	en_US
dc.identifier.nim	NIM211401045
dc.identifier.nidn	NIDN0027017403
dc.identifier.nidn	NIDN0121127801
dc.identifier.kodeprodi	KODEPRODI55201#Ilmu Komputer
dc.description.pages	73 Pages	en_US
dc.description.type	Skripsi Sarjana	en_US
dc.subject.sdgs	SDGs 4. Quality Education	en_US

Files in this item

Name:: Mesin Penerjemah Bahasa Hokkie ...
Size:: 852.5Kb
Format:: PDF
Description:: Cover

View/Open

Name:: Filbert Wijaya_Mesin Penerjemah ...
Size:: 3.600Mb
Format:: PDF
Description:: Fulltext

View/Open

This item appears in the following Collection(s)

Undergraduate Theses [1273]
Skripsi Sarjana

Show simple item record