2023, Volume 7 Issue 2
Early Detection Of Breast Cancer Using Logistic Regression Method
AUTHOR(S)
Saheb karan, Abhik Roy Chowdhury, Amit Pal, Susmita Das, Mrs. Sulekha Das, Avijit Kumar Chaudhuri
DOI: https://doi.org/10.46647/ijetms.2023.v07i02.017
ABSTRACT
Breast cancer is the most frequently occurring cancer disease in women. It is reported almost 14% of cancers in Indian women are breast cancer. It becomes very crucial to predict breast cancer earlier to minimize the deaths. This research article helps to predict breast cancer earlier and reduce the immature deaths of women in India. In this paper, the authors have used the Logistic Regression method to classify the disease.
The authors simulate the results using logistic regression with 10-fold cross-validations and with a different train-test split of the dataset. The 10-fold cross validations display its potential with almost 94% performance in the research paper. With all features and 90-10 , 80-20,50-50, 66-34 splits, and 10-fold cross-validations the authors achieve 96% accuracy.
we have used different accuracy measures like accuracy, sensitivity, specificity, and kappa statistics to get the novelty of the model.
In this study, the authors use the Wisconsin (Diagnostic) Data Set for Breast Cancer, Created by Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Science Centre, Madison, WI 53792 wolberg@eagle.surgery.wisc.edu available at the UCI ML Repository website.
Page No: 133 - 142
References:
[1] M. U. Sarwar, M. K. Hanif, R. Talib, A. Mobeen, and M. Aslam, “A survey of big data analytics in healthcare,” Int. J. Adv. Comput. Sci. Appl., vol. 8, pp. 355-359, 2017.
[2] S. Maldonado, J. López, A. Jimenez -Molina, and H. Lira, “Simultaneous feature selection and heterogeneity control for SVM classification: An application to mental workload assessment,” Expert Syst. Appl., vol. 143, pp. 112988, 2020.
[3] T. Sridevi and A. Murugan, “A novel feature selection method for effective breast cancer diagnosis and prognosis,” Int. J. Comput. Appl., vol. 88, 2014.
[4] L. Breiman, “Random forests,” Mach. Learn, vol. 45, pp. 5-32, 2001.
[5] G. Cavallaro, M. Riedel, M. Richerzhagen, J. A. Benediktsson, and A. Plaza, “On understanding big data impacts in remotely sensed image classification using support vector machine methods,” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., vol. 8, pp. 4634-4646, 2015.
[6] D. J. Hand, H. Mannila, and P. Smyth, Principles of Data Mining: Adaptive Computation and Machine Learning. ISBN: 026208290X, 2001.
[7] H. L. Chen, B. Yang, J. Liu, and D. Y. Liu, “A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis,” Expert Syst. Appl., vol. 38, pp. 9014-9022, 2011.
[8] B. Yuan and X. Ma, “Sampling+ reweighting: Boosting the performance of AdaBoost on imbalanced datasets,” in The 2012 international joint conference on neural networks (IJCNN), IEEE, June 2012, pp. 1-6.
[9] B. Yuan and X. Ma, “Sampling+ reweighting: Boosting the performance of AdaBoost on imbalanced datasets,” in The 2012 international joint conference on neural networks (IJCNN), IEEE, June 2012, pp. 1-6.
[10] W. Kim, K. S. Kim, J. E. Lee, D. Y. Noh, S. W. Kim, Y. S. Jung, and R. W. Park, “Development of novel breast cancer recurrence prediction model using support vector machine,” J. Breast Canc., vol. 15, pp. 230-238, 2012.
[11] M. U. Sarwar, M. K. Hanif, R. Talib, A. Mobeen, and M. Aslam, “A survey of big data analytics in healthcare,” Int. J. Adv. Comput. Sci. Appl., vol. 8, pp. 355-359, 2017.
[12] L. Breiman, “Random forests,” Mach. Learn, vol. 45, pp. 5-32, 2001.
[13] P. Hamsagayathri and P. Sampath, “Performance analysis of breast cancer classification using decision tree classifiers,” Int. J. Curr. Pharm. Res., vol. 9, pp. 19-25, 2017.
[14] B. Xue, M. Zhang, W. N. Browne, and X. Yao, “A survey on evolutionary computation approaches to feature selection,” IEEE Trans. Evol. Comput., vol. 20, pp. 606-626, 2015.
[15] B. Bhasuran, G. Murugesan, S. Abdulkadhar, and J. Natarajan, “Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases,” J Biomed. Informat., vol. 64, pp. 1-9, 2016.
[16] A. K. Chaudhuri, D. Sinha, and K. S. Thyagaraj, “Identification of the recurrence of breast cancer by discriminant analysis,” in Emerging technologies in data mining and information security, Singapore: Springer, 2019, pp. 519-532.
[17] S. L. Ang, H. C. Ong, and H. C. Low, “Classification Using the General Bayesian Network,” Pertanika J. Sci. Technol., vol. 24, 2016.
[18] S. Kumari and M. Arumugam, “Application of bio-inspired krill herd algorithm for breast cancer classification and diagnosis, ”Indian J. Sci. Technol., vol. 8, pp. 30, 2015.
[19] Y. Freund and R. E. Schapire, “Experiments with a new boosting algorithm,” in ICML, vol. 96, July 1996, pp. 148-156.
[20] R. Sikora, “A modified stacking ensemble machine learning algorithm using genetic algorithms,” in Handbook of research on organizational transformations through big data analytics, IGI Global, 2015, pp. 43-53.
[21] S. K. Trivedi and S. Dey, “A study of ensemble based evolutionary classifiers for detecting unsolicited emails, in October 2014, pp. 46-51,in Proceedings of the 2014 conference on research in adaptive and convergent systems.
[22] In 1992, pp. 196-202,F. Wilcoxon, “Individual comparisons by ranking methods,” in Breakthroughs in statistics, New York: Springer.
[23] K. Sivakami and N. Saraswathi, “Mining big data: breast cancer prediction using DT-SVM hybrid model,” Int. J. Sci. Eng. Appl. Sci., vol. 1, pp. 418-429, 2015.
[24] A. Keleş, A. Keleş, and U. Yavuz, “Expert system based on neuro-fuzzy rules for diagnosis breast cancer,” Expert Syst. Appl., vol. 38, pp. 5719-5726, 2011
[25] W. Kim, K. S. Kim, J. E. Lee, D. Y. Noh, S. W. Kim, Y. S. Jung, and R. W. Park, “Development of novel breast cancer recurrence prediction model using support vector machine,” J. Breast Canc., vol. 15, pp. 230-238, 2012.
[26] S. Kharya, S. Agrawal, and S. Soni, “Naive Bayes classifiers: a probabilistic detection model for breast cancer,” Int. J. Comput. Appl., vol. 92, pp. 0975-8887, 2014.
[27] S. Kharya, S. Agrawal, and S. Soni, “Naive Bayes classifiers: a probabilistic detection model for breast cancer,” Int. J. Comput. Appl., vol. 92, pp. 0975-8887, 2014
[28] G. R. Kumar, G. A. Ramachandra, and K. Nagamani, “An efficient prediction of breast cancer data using data mining techniques,” Int. J. Innov. Eng. Technol., vol. 2, pp. 139, 2013.
How to Cite This Article:
Saheb karan, Abhik Roy Chowdhury, Amit Pal, Susmita Das, Mrs. Sulekha Das, Avijit Kumar Chaudhuri
. Early Detection Of Breast Cancer Using Logistic Regression Method
. ijetms;7(2):133-142. DOI: 10.46647/ijetms.2023.v07i02.017