dc.contributor.advisor | Harumy, T Henny Febriana | |
dc.contributor.author | Simamora, Edward Bob | |
dc.date.accessioned | 2025-02-17T01:56:54Z | |
dc.date.available | 2025-02-17T01:56:54Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://repositori.usu.ac.id/handle/123456789/101315 | |
dc.description.abstract | Diabetes is one of the significant global health problems with widespread impacts on
individual quality of life and a significant economic burden on healthcare systems. In
an effort to improve early diagnosis and understanding of factors influencing this
disease, the use of data analysis techniques has become increasingly important. One
approach used is the application of logistic regression algorithms, which provide
information on the probability of diabetes occurrence based on independent variables.
In this study, the use of Information Gain-based feature selection methods is explored
to enhance the performance of logistic regression algorithms in identifying risk factors
for diabetes. Information Gain method is employed to evaluate the relevance of
variables to the target class, i.e., the presence or absence of diabetes. In the
experimental process, a dataset consisting of clinical attributes such as age, body mass
index (BMI), blood pressure, and several other biochemical parameters is used. The
experimental results indicate that the use of Information Gain method for feature
selection can improve the performance of logistic regression models in predicting the
presence of diabetes. By reducing the dimensionality of irrelevant attributes, the
resulting model tends to have higher accuracy and can identify more significant risk
factors. This highlights the potential of Information Gain-based feature selection
methods in enhancing the efficiency and effectiveness of predictive analysis in diabetes. | en_US |
dc.language.iso | id | en_US |
dc.publisher | Universitas Sumatera Utara | en_US |
dc.subject | Feature Selection | en_US |
dc.subject | Classification | en_US |
dc.subject | Information Gain Method | en_US |
dc.subject | Logistic Regression Algorithm | en_US |
dc.subject | Diabetes | en_US |
dc.subject | Data mining | en_US |
dc.subject | Machine Learning | en_US |
dc.subject | Confusion Matrix | en_US |
dc.subject | Python Programming | en_US |
dc.title | Seleksi Fitur dengan Menggunakan Metode Information Gain pada Algoritma Logistic Regression pada Penyakit Diabetes | en_US |
dc.title.alternative | Feature Selection Using Information Gain Method in Logistic Regression Algorithm for Diabetes Disease | en_US |
dc.type | Thesis | en_US |
dc.identifier.nim | NIM171401141 | |
dc.identifier.nidn | NIDN0119028802 | |
dc.identifier.nidn | NIDN0101058801 | |
dc.identifier.kodeprodi | KODEPRODI55201#Ilmu Komputer | |
dc.description.pages | 61 Pages | en_US |
dc.description.type | Skripsi Sarjana | en_US |
dc.subject.sdgs | SDGs 4. Quality Education | en_US |