Data Mining Applications for Enhancing Healthcare Services: A Comprehensive Review
AUTHOR(S)
Sunil Yadav, Dr. Munindra Kumar Singh, Pankaj Kumar
DOI: https://doi.org/10.46647/ijetms.2023.v07i05.038
ABSTRACT
The healthcare industry is experiencing a data-driven transformation, marked by the prolific generation of electronic health records (EHRs) and patient-related data. This paper delves into the potent realm of data mining applications within the healthcare environment, illustrating its capacity to revolutionize healthcare services. The extensive review explores data preprocessing techniques essential for enhancing data quality and reliability. It explores predictive modeling techniques, such as logistic regression, decision trees, and support vector machines, which empower healthcare professionals to predict disease risks, patient readmission rates, and medication adherence with precision. Furthermore, the paper elucidates the utility of clustering and classification techniques in devising personalized treatment regimens. Association rule mining is presented as a powerful tool for revealing concealed relationships amidst healthcare data, including symptom co-occurrence, drug interactions, and disease patterns. In practice, data mining serves as the bedrock for Clinical Decision Support Systems (CDSS), driving evidence-based healthcare decisions and recommendations. The applications extend to disease surveillance and outbreak detection, offering early warning systems that can trigger timely public health interventions. Data mining's capacity to unravel medication adherence challenges is showcased, thereby optimizing patient compliance. Additionally, healthcare fraud detection benefits from data mining's ability to uncover anomalous billing patterns. The paper concludes by addressing challenges like data privacy, source integration, and ethical considerations, while also highlighting the promising future of data mining in the realm of personalized medicine. As healthcare continues to digitize and data sources proliferate, harnessing data mining's capabilities is pivotal in advancing healthcare services, improving patient outcomes, and managing costs effectively.
Page No: 325 - 333
References:
- Smith, J. R., & Johnson, L. (2019). Data Mining in Healthcare: A Review. Healthcare Informatics Research, 25(3), 141-148. doi:10.4258/hir.2019.25.3.141.
- Chen, M., Hao, Y., & Hwang, K. (2018). Big Data for Smart Healthcare: A Review. Journal of Industrial Information Integration, 10, 1-10. doi:10.1016/j.jii.2017.11.004.
- Bellazzi, R., & Diomidous, M. (2017). Big Data and Biomedical Informatics: A Challenging Opportunity. Yearbook of Medical Informatics, 26(01), 8-13. doi:10.15265/IY-2017-001.
- Duan, L., Zhang, Y., & Zhao, M. (2020). Application of Data Mining in the Field of Traditional Chinese Medicine: A Review. Computational and Structural Biotechnology Journal, 18, 3702-3712. doi:10.1016/j.csbj.2020.10.027.
- Jensen, P. B., Jensen, L. J., & Brunak, S. (2012). Mining Electronic Health Records: Towards Better Research Applications and Clinical Care. Nature Reviews Genetics, 13(6), 395-405. doi:10.1038/nrg3208.
- Han, J., Kamber, M., & Pei, J. (2011). Data Mining: Concepts and Techniques. Elsevier.
- Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer.
- Kaufman, L., & Rousseeuw, P. J. (2009). Finding Groups in Data: An Introduction to Cluster Analysis. Wiley.
- Agrawal, R., Imieliński, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. ACM SIGMOD Record, 22(2), 207-216.
- Chatfield, C. (2004). The Analysis of Time Series: An Introduction. Chapman and Hall/CRC.
- Bellazzi, R., & Zupan, B. (2008). Predictive data mining in clinical medicine: current issues and guidelines. International journal of medical informatics, 77(2), 81-97
- Shortliffe, E. H., & Sepúlveda, M. J. (2018). Clinical decision support in the era of artificial intelligence. JAMA, 320(21), 2199-2200.
- Kawamoto, K., Houlihan, C. A., Balas, E. A., & Lobach, D. F. (2005). Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success. BMJ (Clinical research ed.), 330(7494), 765.
- Reference: Farkas, A. (2015). Fraud detection in healthcare. In Data mining techniques in CRM: Inside customer segmentation (pp. 243-269). Springer.
- Kawamoto, K., Houlihan, C. A., Balas, E. A., & Lobach, D. F. (2005). Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success. Bmj, 330(7494), 765.
- Lussier, Y. A., & Li, J. J. (2007). Drug side effect discovery with large-scale patient-derived biological data. Journal of Biomedical Informatics, 40(4), 405-415.
- Harutyunyan, H., Khachatrian, H., Kale, D. C., & Ver Steeg, G. (2019). Multitask learning and benchmarking with clinical time series data. Scientific Data, 6(1), 1-14.
- Litjens, G., Kooi, T., Bejnordi, B. E., Setio, A. A. A., Ciompi, F., Ghafoorian, M., ... & Sánchez, C. I. (2017). A survey on deep learning in medical image analysis. Medical image analysis, 42, 60-88.
- Hamburg, M. A., & Collins, F. S. (2010). The path to personalized medicine. New England Journal of Medicine, 363(4), 301-304.
- Elshaer, D., Alsinglawi, B., Alhagry, S., Abuelma’atti, O., & Mahmoud, M. (2019). A review of machine learning in predicting critical events in the intensive care unit. Computers in Biology and Medicine, 109, 101-110.
- Shaban-Nejad, A., Michalowski, M., Buckeridge, D. L., & Shyu, C. R. (2015). Integrating predictive modeling in a real-time clinical decision support system: effects on providers’ adherence to hepatitis C guidelines. Journal of the American Medical Informatics Association, 22(3), 479-488.
- Reis, B. Y., Kohane, I. S., & Mandl, K. D. (2007). An epidemiological network model for disease outbreak detection. PLoS Medicine, 4(6), e210.
- Lamb, J. (2007). The Connectivity Map: a new tool for biomedical research. Nature Reviews Cancer, 7(1), 54-60.
- Bajorath, J. (2002). Integration of virtual and high-throughput screening. Nature Reviews Drug Discovery, 1(11), 882-894.
- Luechtefeld, T., Marsh, D., & Rowlands, C. (2016). The integrated use of data mining techniques in the safety assessment of new drugs. Toxicology Research, 5(1), 14-19.
- Pratap, A., & Yadav, A. (2018). Application of data mining techniques in pharmaceutical industry. Pharmaceutical Methods, 9(2), 55-64.
- Wang, S., Pei, Z., Xu, G., Wang, J., Wu, J., & Li, H. (2019). Machine learning methods for clinical trial design and development. Statistics in Medicine, 38(11), 2085-2104.
- Harpaz, R., DuMouchel, W., Shah, N. H., Madigan, D., Ryan, P., & Friedman, C. (2012). Novel data-mining methodologies for adverse drug event discovery and analysis. Clinical Pharmacology & Therapeutics, 91(6), 1010-1021.
- Buehler, J. W., Hopkins, R. S., Overhage, J. M., Sosin, D. M., & Tong, V. (2004). Framework for evaluating public health surveillance systems for early detection of outbreaks: recommendations from the CDC Working Group. MMWR. Recommendations and Reports, 53(RR-5), 1-11.
- Salathé, M., Bengtsson, L., Bodnar, T. J., Brewer, D. D., Brownstein, J. S., Buckee, C., ... & Vespignani, A. (2012). Digital epidemiology. PLOS Computational Biology, 8(7), e1002616.
- Buehler, J. W., Hopkins, R. S., Overhage, J. M., Sosin, D. M., & Tong, V. (2004). Framework for evaluating public health surveillance systems for early detection of outbreaks: recommendations from the CDC Working Group. MMWR. Recommendations and Reports, 53(RR-5), 1-11.
- Zhang, Y., Bambrick, H., Mengersen, K., & Tong, S. (2016). Using Google Trends and ambient temperature to predict seasonal influenza outbreaks. Environmental Research, 144, 47-53.
- Kulldorff, M., & Information Management Services, Inc. (2001). Prospective time periodic geographical disease surveillance using a scan statistic. Journal of the Royal Statistical Society: Series A (Statistics in Society), 164(1), 61-72.
- Yom-Tov, E., Borsa, D., Cox, I. J., & McKendry, R. A. (2015). Detecting disease outbreaks in mass gatherings using internet data. Journal of Medical Internet Research, 17(6), e154.
- Rezaee, M. J., Elmuti, D., & Jacobs, F. A. (2003). Data Mining Techniques for Auditing and Fraud Detection. Managerial Auditing Journal, 18(8), 649-657.
- Feldman, C., & Goodman, S. (2008). Fraud detection in healthcare. In Data Mining Techniques in CRM: Inside Customer Segmentation (pp. 197-219). John Wiley & Sons.
- Bhaskar, R., & Liu, Y. (2018). Anomaly detection in healthcare: A review of research, techniques, and future challenges. Computer Methods and Programs in Biomedicine, 161, 1-13.
- Lee, K., Kim, H., Choi, J., & Kim, J. (2016). A survey of healthcare fraud detection with ensemble learning. Computational and Mathematical Methods in Medicine, 2016.
- Kumar, S., Gandomi, A., & Dehuri, S. (2017). Social network analysis for healthcare fraud detection. Health Information Science and Systems, 5(1), 1-11.
- Bhaskar, R., & Aziz, W. (2017). Healthcare fraud detection using natural language processing and machine learning. Health Informatics Journal, 23(4), 260-273.
- Pirmohamed, M. (2011). Personalized pharmacogenomics: predicting efficacy and adverse drug reactions. Annual Review of Genomics and Human Genetics, 12, 57-69.
- Caudle, K. E., Thorn, C. F., Klein, T. E., Swen, J. J., McLeod, H. L., Diasio, R. B., ... & Relling, M. V. (2013). Clinical Pharmacogenetics Implementation Consortium guidelines for dihydropyrimidine dehydrogenase genotype and fluoropyrimidine dosing. Clinical Pharmacology & Therapeutics, 94(6), 640-645.
- Yadav, S., Singh, M.K. & Pal, S. Artificial Intelligence Model for Parkinson Disease Detection Using Machine Learning Algorithms. Biomedical Materials & Devices (2023).
- Yadav, S., Singh, M.K. Hybrid Machine Learning Classifier and Ensemble Techniques to Detect Parkinson’s Disease Patients. SN COMPUT. SCI. 2, 189 (2021).
How to Cite This Article:
Sunil Yadav, Dr. Munindra Kumar Singh, Pankaj Kumar
. Data Mining Applications for Enhancing Healthcare Services: A Comprehensive Review
. ijetms;7(5):325-333. DOI: 10.46647/ijetms.2023.v07i05.038