Pendeteksi Kemiripan Teks Paragraf dalam Dokumen Menggunakan Algoritma Leacock Chodorow dan Cosine Similarity
dc.contributor.advisor | Sitompul, Opim Salim | |
dc.contributor.advisor | Nababan, Erna Budhiarti | |
dc.contributor.author | Choales, Aldrich William | |
dc.date.accessioned | 2024-02-15T07:39:27Z | |
dc.date.available | 2024-02-15T07:39:27Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://repositori.usu.ac.id/handle/123456789/91273 | |
dc.description.abstract | Plagiarism is the act of closely copying or imitating, taking works from authors or creators without prior permission, with the intention of claiming the work as one's own original creation. Plagiarism is a common issue in both general and academic settings. In the academic context, plagiarism poses a significant problem because verifying the authenticity of a document is time-consuming and varies in terms of precision. Therefore, an approach need to be to developed aimed at detecting paragraph similarity in documents. This research aims to address the issue by automatically detecting similarity in the content of scholarly documents based on paragraphs, using the Leacock Chodorow algorithm and cosine similarity. In the system testing, 28 test data and reference data are used, with 7 test documents consisting of a total of 177 test paragraphs and 21 reference documents consisting of a total of 438 reference paragraphs. The evaluation results of the paragraph similarity detection system obtained an accuracy of 0.923 or 92.3%, precision of 0.908 or 90.8%, recall of 0.953 or 95.3%, and an F-measure of 0.930 or 93%. | en_US |
dc.language.iso | id | en_US |
dc.publisher | Universitas Sumatera Utara | en_US |
dc.subject | Text Similarity | en_US |
dc.subject | Leacock Chodorow Similarity | en_US |
dc.subject | Cosine Similarity | en_US |
dc.subject | Plagiarism | en_US |
dc.subject | SDGs | en_US |
dc.title | Pendeteksi Kemiripan Teks Paragraf dalam Dokumen Menggunakan Algoritma Leacock Chodorow dan Cosine Similarity | en_US |
dc.type | Thesis | en_US |
dc.identifier.nim | NIM181402074 | |
dc.identifier.nidn | NIDN0017086108 | |
dc.identifier.nidn | NIDN0026106209 | |
dc.identifier.kodeprodi | KODEPRODI59201#Teknologi Informasi | |
dc.description.pages | 74 Halaman | en_US |
dc.description.type | Skripsi Sarjana | en_US |
Files in this item
This item appears in the following Collection(s)
-
Undergraduate Theses [765]
Skripsi Sarjana