Show simple item record

dc.contributor.advisorArisandi, Dedy
dc.contributor.advisorPurnamawati, Sarah
dc.contributor.authorSihombing, Johansen
dc.date.accessioned2025-07-11T09:01:14Z
dc.date.available2025-07-11T09:01:14Z
dc.date.issued2025
dc.identifier.urihttps://repositori.usu.ac.id/handle/123456789/105307
dc.description.abstractWord complexity in English texts poses a significant challenge in the field of Natural Language Processing (NLP), particularly for the development of automatic text simplification systems and effective second language learning support tools. Language learners' comprehension is often hindered by highly complex words. This study aims to develop and evaluate an English word complexity prediction system using DeBERTa (Decoding-enhanced BERT with Disentangled Attention), a Transformer model renowned for its superior contextual representation. The model was trained and tested on a dataset comprising 8,554 word entries, compiled from the Complex dataset and augmented with data from the Oxford Dictionary. Evaluation results demonstrated excellent predictive performance, achieving a Mean Squared Error (MSE) of 0.0036, a Mean Absolute Error (MAE) of 0.0402, and a Pearson correlation of 0.9770 on the test set. These findings indicate that the DeBERTa model possesses high accuracy and robust generalization capabilities in assessing word complexity across diverse text domains, highlighting its significant potential for advancing NLP applications concerned with word complexity analysis and processing.en_US
dc.language.isoiden_US
dc.publisherUniversitas Sumatera Utaraen_US
dc.subjectWord Complexityen_US
dc.subjectDeBERTaen_US
dc.subjectNatural Language Processingen_US
dc.subjectComplexity Predictionen_US
dc.subjectEnglish Languageen_US
dc.titleImplementasi Model DeBERTa untuk Prediksi Kompleksitas Kata Berbahasa Inggrisen_US
dc.title.alternativeImplementation of the DeBERTa Model for English Word Complexity Predictionen_US
dc.typeThesisen_US
dc.identifier.nimNIM211402058
dc.identifier.nidnNIDN0031087905
dc.identifier.nidnNIDN0026028304
dc.identifier.kodeprodiKODEPRODI59201#Teknologi Informasi
dc.description.pages91 Pagesen_US
dc.description.typeSkripsi Sarjanaen_US
dc.subject.sdgsSDGs 4. Quality Educationen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record