Multi-factors correlation to diabetes using Machine Learning: findings from BRFSS

Authors

  • Shiying Wu

DOI:

https://doi.org/10.62051/g4nz0017

Keywords:

Machine Learning; Data Mining; Diabetes.

Abstract

The escalating prevalence of diabetes has emerged as a burgeoning national concern, necessitating comprehensive investigation and intervention. Leveraging data extracted from the 2021 Behavioral Risk Factor Surveillance System (BRFSS), this study harnesses the power of data mining techniques to discern salient variables within expansive datasets. Subsequently, a series of machine learning models, comprising Random Forest, Decision Tree, Lasso Regression, and Neural Network, were constructed, and evaluated. The principal objective of this research is to facilitate the identification of the optimal predictive model for diabetes. By scrutinizing and contrasting the performance of these diverse models, this study aspires to contribute valuable insights into the field of diabetes prediction, potentially aiding in the development of more accurate and effective diagnostic tools. Consequently, this research and to make a meaningful stride towards mitigating the adverse impacts of the burgeoning diabetes epidemic and underscores the pivotal role of data-driven analytics and machine learning in the realm of public health research and policy formulation.

Downloads

Download data is not yet available.

References

World Health Organization 2023 Diabetes. https://www.who.int/news-room/fact- sheets/detail/diabetes.

Diabetes UK 2023 Complications of diabetes https://www.diabetes.org.uk/guide-to-diabetes/complications.

American Diabetes Association 2018 Economic costs of diabetes in the US in 2017 Diabetes Care pp 917 - 928.

Centers for Disease Control and Prevention Prevalence of Both Diagnosed and Undiagnosed Diabetes https://www.cdc.gov/diabetes/data/statistics-report/diagnosed-undiagnosed-diabetes.html.

Schauer, Gillian L, Abigail H, Lloyd M and Mark D 2013 Health professional advice for smoking and weight in adults with and without diabetes: findings from BRFSS Journal of Behavioral Medicine, vol. 36, no. 1 pp. 10+ http: //dx.doi.org/10.1007/s10865 - 011 - 9386 - 9.

Tran P, Tran L and Tran L 2019 Impact of rurality on diabetes screening in the US. BMC Public Health 19(1) pp 1190 – 1190. https: //doi.org/10.1186/s12889 - 019 - 7491 - 9.

Tung L, Baig A, Huang S, Laiteerapong N, Chua K 2017 Racial and Ethnic Disparities in Diabetes Screening Between Asian Americans and Other Adults: BRFSS 2012–2014 J GEN INTERN MED 32, pp 423 – 429 https://doi-org.ezproxy.bu.edu/10.1007/s11606-016-3913-x.

Yoon S, Taha B and Bakken S 2014 Using a data mining approach to discover behavior correlates of chronic disease: a case study of depression Stud Health Technol Inform 201: 71 - 8.

Lu H, Uddin S, Hajati F, Moni MA and Khushi M 2022 A patient network-based machine learning model for disease prediction: the case of type 2 diabetes mellitus Appl Intell https: //doi.org/10.1007/s10489 - 021 - 02533 - w.

Xie Z, Nikolayeva O, Luo J and Li D 2019 Building risk prediction models for type 2 diabetes using Machine Learning Techniques Preventing Chronic Disease 16 https: //doi.org/10.5888/pcd16.190109.

Fayyad U, Piatetski-Shapiro G, Smyth P and Uthurusamy R 1998 Advances in Knowledge Discovery and Data Mining Technometrics, 40 (1), p. 83. https: //doi.org/10.2307/1271414.

Mariwan S 2023 Diabetes type 2 classification using machine learning algorithms with up-sampling technique Journal of Electrical Systems and Information Technology, vol. 10 no. 1 http://dx.doi.org/10.1186/s43067-023-00074 - 5.

Brownlee J 2020 Imbalanced Classification with Python: Better Metrics, Balance Skewed Classes, Cost- Sensitive Learning. Machine Learning Mastery.

Downloads

Published

24-03-2024

How to Cite

Wu, S. (2024). Multi-factors correlation to diabetes using Machine Learning: findings from BRFSS. Transactions on Materials, Biotechnology and Life Sciences, 3, 143-149. https://doi.org/10.62051/g4nz0017