| dc.contributor.advisor | Sawaluddin, Sawaluddin | |
| dc.contributor.author | Hutabarat, David Kevin Handel | |
| dc.date.accessioned | 2026-01-06T02:28:54Z | |
| dc.date.available | 2026-01-06T02:28:54Z | |
| dc.date.issued | 2026 | |
| dc.identifier.uri | https://repositori.usu.ac.id/handle/123456789/111753 | |
| dc.description.abstract | This study analyzes and compares the performance of Random Forest and Elastic Net Logistic Regression (SAGA solver) for classifying patient discharge status at BPJS Kesehatan Primary Care Facilities (FKTP). The dataset is large-scale, contains exclusively nominal predictors, and exhibits class imbalance (approximately 64.9% majority class). The experimental design employs an 80%/20% train–test split, one-hot encoding for preprocessing, and class balancing on the training data via random undersampling. Hyperparameter tuning is conducted using a staged coarse-to-fine search with local-optimum convergence criteria (improvement threshold ε = 10^(−6) and patience = 10), followed by 10-fold cross-validation for internal evaluation and final assessment on the test set. The primary evaluation metrics are F1-Score, Precision–Recall AUC (PR AUC), and Brier Score. On the test set, both methods achieve identical F1-Scores and nearly identical PR-AUC values. Random Forest and Elastic Net Logistic Regression both attain F1 = 0.996679, with PR-AUC = 0.999931 for Random Forest and 0.999927 for Elastic Net Logistic Regression. A more interpretable difference is observed in probability calibration, where Elastic Net Logistic Regression yields a lower Brier Score (0.002016) compared to Random Forest (0.002706). These results indicate that while both models lie on the same performance plateau in terms of discrimination and ranking, Elastic Net Logistic Regression provides better-calibrated probability estimates. | en_US |
| dc.language.iso | id | en_US |
| dc.publisher | Universitas Sumatera Utara | en_US |
| dc.subject | Brier Score | en_US |
| dc.subject | F1 Score | en_US |
| dc.subject | Penalized Logistic Regression | en_US |
| dc.subject | PR-AUC | en_US |
| dc.subject | Random Forest | en_US |
| dc.title | Studi Komparatif Random Forest dan Regresi Logistik Elastic Net pada Klasifikasi Status Pulang FKTP BPJS Kesehatan: Kajian F1-Score, PR-AUC, dan Brier Score | en_US |
| dc.title.alternative | Comparative Study of Random Forest and Elastic Net Logistic Regression in Classifying Patient Discharge Status at BPJS Kesehatan Primary Healthcare Facilities: An Evaluation Based on F1-Score, PR AUC, and Brier Score | en_US |
| dc.type | Thesis | en_US |
| dc.identifier.nim | NIM190803100 | |
| dc.identifier.nidn | NIDN0031125982 | |
| dc.identifier.kodeprodi | KODEPRODI44201#Matematika | |
| dc.description.pages | 144 Pages | en_US |
| dc.description.type | Skripsi Sarjana | en_US |
| dc.subject.sdgs | SDGs 4. Quality Education | en_US |