Pembangkitan Anotasi Peta Berdasarkan Ekstraksi Data Berita Penyakit Endemik Menggunakan Pendekatan Natural Language Generation

Lianto, Jefry

Pembangkitan Anotasi Peta Berdasarkan Ekstraksi Data Berita Penyakit Endemik Menggunakan Pendekatan Natural Language Generation

dc.contributor.advisor	Arisandi, Dedy
dc.contributor.advisor	Jaya, Ivan
dc.contributor.author	Lianto, Jefry
dc.date.accessioned	2025-01-08T08:38:56Z
dc.date.available	2025-01-08T08:38:56Z
dc.date.issued	2025
dc.identifier.uri	https://repositori.usu.ac.id/handle/123456789/99943
dc.description.abstract	In the digital era, news data serves as an abundant information source but is often unstructured, particularly regarding the spread of endemic diseases. This study aims to implement automatic data extraction from news articles about endemic diseases to generate map annotations using a Natural Language Generation (NLG) approach. The developed system utilizes scraping techniques to gather data from online news articles, which are then processed through summarization using the pre-trained BART model and a frequency-based method. The text preprocessing steps include case folding, tokenization, and stopword removal. The extracted data is used to identify geographic locations and types of diseases mentioned, which are then annotated onto maps for visualization in a Geographic Information System (GIS). The evaluation was conducted using multiple metrics, including BLEU, ROUGE, and BERTScore, showing strong performance with average accuracies. The average BERTScore F1 was 0.88, BLEU was 0.78, ROUGE-1 was 0.84, ROUGE-2 was 0.80, and ROUGE-L was 0.84, indicating high consistency between the summaries and the original texts. Additionally, an evaluation involving three groups of respondents (general public, medical professionals, and linguists) revealed that 80% found the summaries easy to understand, 75% found them clear, and 70% found them easy to read. These findings demonstrate that the NLG approach effectively generates informative news summaries and accurate map annotations to facilitate monitoring and managing the spread of endemic diseases in Indonesia.	en_US
dc.language.iso	id	en_US
dc.publisher	Universitas Sumatera Utara	en_US
dc.subject	Natural Language Generation	en_US
dc.subject	Map Annotations	en_US
dc.subject	News Data Extraction	en_US
dc.subject	Bidirectional and Auto-Regressive Transformers	en_US
dc.subject	Frequency-Based Method	en_US
dc.title	Pembangkitan Anotasi Peta Berdasarkan Ekstraksi Data Berita Penyakit Endemik Menggunakan Pendekatan Natural Language Generation	en_US
dc.title.alternative	Generation of Map Annotations Based on Extraction of Endemic Disease News Data Using Natural Language Generation Approach	en_US
dc.type	Thesis	en_US
dc.identifier.nim	NIM191402027
dc.identifier.nidn	NIDN0031087905
dc.identifier.nidn	NIDN0107078404
dc.identifier.kodeprodi	KODEPRODI59201#Teknologi Informasi
dc.description.pages	96 Pages	en_US
dc.description.type	Skripsi Sarjana	en_US
dc.subject.sdgs	SDGs 4. Quality Education	en_US

Files in this item

Name:: Pembangkitan Anotasi Peta ...
Size:: 370.2Kb
Format:: PDF
Description:: Cover

View/Open

Name:: Jefry Lianto_Pembangkitan Anotasi ...
Size:: 1.889Mb
Format:: PDF
Description:: Fulltext

View/Open

This item appears in the following Collection(s)

Undergraduate Theses [883]
Skripsi Sarjana

Show simple item record