Optimasi Parameter Tuning pada Model Regresi Logistik Lasso

Fadhilah, Syarifah

Optimasi Parameter Tuning pada Model Regresi Logistik Lasso

dc.contributor.advisor	Yanti, Maulida
dc.contributor.author	Fadhilah, Syarifah
dc.date.accessioned	2025-10-06T05:25:04Z
dc.date.available	2025-10-06T05:25:04Z
dc.date.issued	2025
dc.identifier.uri	https://repositori.usu.ac.id/handle/123456789/108980
dc.description.abstract	This study examines the optimization of the C parameter defined as the inverse of λ in the LASSO logistic regression model by comparing three cross-validation methods: KFold, Stratified KFold (SKF), and Repeated Stratified KFold (RSKF). Three ranges of C values (0.01–0.1, 0.0001–0.0251, and 0.1–316.2) were evaluated using log loss as the primary metric and F1-score as a secondary measure, while also considering the number of selected variables. The results show that the optimal C value leads to varying levels of variable selection. On the original dataset, C = 0.1 selected 8 out of 11 variables with an F1-score of 0.911 and a log loss of 0.303. For simulated data I (n = 150), C = 0.1 retained all 11 variables with no selection and an F1-score of 0.837. On simulated data II (n = 550),C = 0.0251 selected 11 of 15 variables (removing 4 noise variables) with an F1-score of 0.845. For simulated data III (n = 1500), the same C value selected 16 of 20 variables, eliminating 4 noise variables, with an F1-score of 0.884. The findings indicate that Stratified KFold provides the most stable results for imbalanced data. The smaller C range (0.0251–0.0001) was effective in filtering out noise variables, whereas larger ranges tended to retain more variables. A small difference between training and testing F1-scores (< 0.05) suggests stable models.	en_US
dc.language.iso	id	en_US
dc.publisher	Universitas Sumatera Utara	en_US
dc.subject	LASSO logistic regression	en_US
dc.subject	parameter C optimization	en_US
dc.subject	variable selection	en_US
dc.subject	cross-validation	en_US
dc.subject	log loss	en_US
dc.title	Optimasi Parameter Tuning pada Model Regresi Logistik Lasso	en_US
dc.title.alternative	Optimization of Parameter Tuning in the Lasso Logistic Regression Model	en_US
dc.type	Thesis	en_US
dc.identifier.nim	NIM210803115
dc.identifier.nidn	NIDN0024109003
dc.identifier.kodeprodi	KODEPRODI44201#Matematika
dc.description.pages	59 Pages	en_US
dc.description.type	Skripsi Sarjana	en_US
dc.subject.sdgs	SDGs 4. Quality Education	en_US

Files in this item

Name:: Optimasi Parameter Tuning pada ...
Size:: 512.8Kb
Format:: PDF
Description:: Cover

View/Open

Name:: Syarifah Fadhilah_Optimasi ...
Size:: 764.0Kb
Format:: PDF
Description:: Fulltext

View/Open

This item appears in the following Collection(s)

Undergraduate Theses [1496]
Skripsi Sarjana

Show simple item record