Prediksi Faktor Risiko Gangguan Tidur Menggunakan Pendekatan Machine Learning Logistic Regression dan Gradient Boosting
##plugins.themes.academic_pro.article.main##
Abstract
Sleep disorders are one of the major public health issues with broad implications for quality of life, productivity, and chronic disease risks. This study aims to predict risk factors of sleep disorders using a machine learning approach with survey data from the National Sleep Foundation. The research process involved data Cleaning, transformation, normalization, and splitting into training (80%) and testing (20%) sets. Two algorithms were applied, Logistic Regression and Gradient Boosting, and their performance was evaluated using Accuracy, Precision, Recall, and F1-score metrics. SHAP analysis was also employed to assess variable contributions to model predictions. The results indicate that Gradient Boosting outperformed Logistic Regression, achieving perfect Accuracy and F1-score (1.00), while Logistic Regression only reached 0.70. SHAP analysis revealed that sleep duration and quality are the most influential factors, followed by caffeine consumption and age. Therefore, Gradient Boosting not only provides accurate classification but also comprehensive insights into key determinants of sleep disorders, serving as a foundation for more effective health interventions.
##plugins.themes.academic_pro.article.details##

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
References
- Daftar Pustaka
- Mohammad Rasel Mahmud, Al Shahriar Uddin Khondakar Pranta, Anamul Haque Sakib, Abdullah Al Sakib, and Md Ismail Hossain Siddiqui, “Robust feature selection for improved sleep stage classification,” Int. J. Sci. Res. Arch., vol. 15, no. 1, pp. 1790–1797, 2025, doi: 10.30574/ijsra.2025.15.1.1160.
- Md Ismail Hossain Siddiqui, Anamul Haque Sakib, Sanjida Akter, Jesika Debnath, and Mohammad Rasel Mahmud, “Comparative analysis of traditional machine learning Vs deep learning for sleep stage classification,” Int. J. Sci. Res. Arch., vol. 15, no. 1, pp. 1778–1789, 2025, doi: 10.30574/ijsra.2025.15.1.1159.
- G. Long, W. Qingyan, T. Jian, W. Duo, W. Xiaotian, and D. Hongfei, “?????? Advances in Aeronautical Science and Engineering,” Adv. Aeronaut. Sci. Eng., vol. 14, no. 2, pp. 178–190, 2023.
- M. Jain and A. Srihari, “Comparison of Machine learning Algorithm in Intrusion Detection Systems: A Review Using Binary Logistic Regression,” Int. J. Comput. Sci. Mob. Comput., vol. 13, no. 10, pp. 45–53, 2024, doi: 10.47760/ijcsmc.2024.v13i10.005.
- C. Zhang, Y. Zhang, J. Pu, Z. Liu, Z. Wang, and L. Wang, “An hourly solar radiation prediction model using eXtreme Gradient Boosting algorithm with the effect of fog-haze,” Energy Built Environ., vol. 6, no. 1, pp. 18–26, 2025, doi: 10.1016/j.enbenv.2023.08.001.
- J. Crossa et al., “Machine learning algorithms translate big data into predictive breeding Accuracy,” Trends Plant Sci., vol. 30, no. 2, pp. 167–184, 2025, doi: 10.1016/j.tplants.2024.09.011.
- R. Vadiasetty et al., “Artificial Intelligence-Based ML Techniques and Big Data Analytics to Precision Diabetes Diagnosis in Healthcare Systems,” vol. 2023, no. 3, pp. 1–8, 2023, doi: 10.71010/2023/JCETAI-e103-1.
- J. P. Ntayagabiri, Y. Bentaleb, J. Ndikumagenge, and H. El Makhtoum, “A Comparative Analysis of Supervised Machine learning Algorithms for IoT Attack Detection and Classification,” J. Comput. Theor. Appl., vol. 2, no. 3, pp. 395–409, 2025, doi: 10.62411/jcta.11901.
- S. M. K. Alam, P. Li, M. Rahman, M. Fida, and V. Elumalai, “Key factors affecting groundwater nitrate levels in the Yinchuan Region, Northwest China: Research using the eXtreme Gradient Boosting (XGBoost) model with the SHAPley Additive Explanations (SHAP) method,” Environ. Pollut., vol. 364, no. P1, p. 125336, 2025, doi: 10.1016/j.envpol.2024.125336.