Show simple item record

dc.contributor.advisorMahyuddin
dc.contributor.advisorAmalia
dc.contributor.authorLubis, Hasby Sahendri
dc.date.accessioned2024-09-06T09:21:43Z
dc.date.available2024-09-06T09:21:43Z
dc.date.issued2024
dc.identifier.urihttps://repositori.usu.ac.id/handle/123456789/96949
dc.description.abstractTerm Frequency-Inverse Document Frequency (TF-IDF) is used to assess the importance of words in a document relative to the rest of the document set, while K-Means clusters documents based on content similarity. Utilizing a text dataset covering various government services, this study measures the effectiveness of these methods in identifying and clustering these services. Text pre-processing reduced the number of words from 30,753 to 15,783, indicating the elimination of irrelevant words. Visualization of the TF-IDF scatter plot shows a negative relationship between the frequency of occurrence of a word (TF) and its uniqueness (IDF). Clustering performance evaluation was performed using Silhouette Index (SI) and Davies Bouldin Index (DBI), which showed the consistency and good quality of the generated clusters. A stable SI value of about 0.620 and a consistent DBI value of about 0.551 indicate that the K-Means algorithm, both with the Euclidean and Manhattan approaches, is effective in grouping comments into clusters representing negative, neutral, and positive sentiments. The results of this research make a significant contribution to the development of information systems that are more efficient and responsive to public needs, as well as strengthening text data management in the context of government.en_US
dc.language.isoiden_US
dc.publisherUniversitas Sumatera Utaraen_US
dc.subjectDavies Bouldin Indexen_US
dc.subjectK-Means Clusteringen_US
dc.subjectPublic Servicesen_US
dc.subjectSilhouette Indexen_US
dc.subjectTF-IDFen_US
dc.subjectSDGsen_US
dc.titleUnjuk Kerja Term Frequency – Inverse Document Frequency dan K-Means dalam Identifikasi Layanan Pemerintahen_US
dc.title.alternativePerformance of Term Frequency – Inverse Document Frequency and K-Means in Government Service Identificationen_US
dc.typeThesisen_US
dc.identifier.nimNIM217038028
dc.identifier.nidnNIDN0025126703
dc.identifier.nidnNIDN0121127801
dc.identifier.kodeprodiKODEPRODI55101#Teknik Informatika
dc.description.pages128 Pagesen_US
dc.description.typeTesis Magisteren_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record