Machine Learning in Data Analysis and Visualization in Healthcare

Authors

  • Yimin Wang

DOI:

https://doi.org/10.62051/21k3b546

Keywords:

Machine Learning; Healthcare; Visualization; Pandemi.

Abstract

Data visualization becomes an important tool for effective processing and analysis of large databases during epidemics, helping researchers, public health experts, and policymakers quickly identify trends, patterns, and anomalies by translating complex data into easy-to-understand graphs and charts. This study examines the application of Python and machine learning techniques to the analysis of outbreak data, specifically the methods used to process and analyze data during the COVID-19 outbreak. This paper concludes that data cleaning and feature extraction are important to ensure the accuracy and consistency of data. Furthermore, appropriate for the prediction of epidemic trends is the ARIMA model. Random Forest may therefore be used for case categorization as well as for SEIR model-based epidemic simulation. The huge possibilities of machine learning in enhancing the science and efficiency of public health decision-making are exposed in this work. At last, for the next pyhton applications, model interpretability, and data privacy protection become increasingly critical. This work offers a useful guide for further investigation and optimization of machine learning use in healthcare.

Downloads

Download data is not yet available.

References

[1] Q. Yang, et al., Research on COVID-19 based on Arima model—taking Hubei, China as an example to see the epidemic in Italy, Journal of Infection and Public Health, vol. 13, no. 10, pp. 1415 – 1418, Oct. 2020.

[2] Z. Yang, et al., Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions, Journal of Thoracic Disease, vol. 12, no. 3, pp. 165–174, Mar. 2020.

[3] J. Wang, et al., A descriptive study of random forest algorithm for predicting COVID-19 patients’ outcome, PeerJ, vol. 8, 9 Sept. 2020.

[4] J.T. Wu, et al., Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: A modelling study, The Lancet, vol. 395, no. 10225, pp. 689 – 697, Feb. 2020.

[5] R.D. Joshi and C.K. Dhakal, predicting type 2 diabetes using logistic regression and machine learning approaches, International Journal of Environmental Research and Public Health, vol. 18, no. 14, p. 7346, 9 July 2021.

[6] A. Esteva, et al., Erratum: Corrigendum: Dermatologist-level classification of skin cancer with Deep Neural Networks, Nature, vol. 546, no. 7660, p. 686, June 2017.

[7] O. Ronneberger, et al., U-Net: Convolutional Networks for Biomedical Image Segmentation, Lecture Notes in Computer Science, pp. 234 – 241, 2015.

[8] X. Yi, et al., Generative Adversarial Network in medical imaging: A Review, Medical Image Analysis, vol. 58, p. 101552, Dec. 2019.

[9] Y. Qiu, et al., Novel Gene Signatures Predicting Breast Cancer Based on Random Forest and Artificial Neural Network, 30 Mar. 2022.

[10] T. Chen and C. Guestrin, XGBoost, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 13 Aug. 2016.

[11] Q. Yang, Y. Liu, et al., Federated machine learning, ACM Transactions on Intelligent Systems and Technology, vol. 10, no. 2, pp. 1 – 19, 28 Jan. 2019.

Downloads

Published

25-11-2024

How to Cite

Wang, Y. (2024) “Machine Learning in Data Analysis and Visualization in Healthcare”, Transactions on Computer Science and Intelligent Systems Research, 7, pp. 356–362. doi:10.62051/21k3b546.