International Journal of Engineering Technology and Management Sciences

2023, Volume 7 Issue 3

BIOMEDICAL TEXT DOCUMENT CLASSIFICATION

AUTHOR(S)

M Vivaswanth Kashyap, Vaidya Saideepa, K Shivadar Reddy, Mrs.T.Ratnamala

DOI: https://doi.org/10.46647/ijetms.2023.v07i03.121

ABSTRACT
Information extraction, retrieval, and text categorization are only a few of the significant research fields covered by "bio medical text classification." This study examines many text categorization techniques utilised in practise, as well as their strengths and weaknesses, in order to improve knowledge of various information extraction opportunities in the field of data mining. We compiled a dataset with a focus on three categories: "Thyroid Cancer," "Lung Cancer," and "Colon Cancer." This paper presents an empirical study of a classifier. The investigation was carried out using biomedical literature benchmarks. Many metaheuristic algorithms are investigated, including genetic algorithms, particle swarm optimisation, firefly, cuckoo, and bat algorithms. In addition, the proposed multiple classifier system outperforms ensemble learning, ensemble pruning, and traditional classification methods. Based on the data, we forecast if it is Thyroid Cancer, Lung Cancer, or Colon Cancer using basic EDA, text preprocessing, and several models such as Logistic Regression, Decision Tree Classification, and Random Forest Classification.

Page No: 788 - 792

References:

    [1] Divina, Federico, Onan, Aytug – “Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling”, 2018
    [2] E. S. Chen, G. Hripcsak, H. Xu, M. Markatou, and C. Friedman, “Automated acquisition of disease–drug knowledge from biomedical and clinical documents: an initial study,” Journal of the American Medical Informatics Association, vol. 15, no. 1, pp. 87–98, 2008.
    [3] R. Rodriguez-Esteban, “Biomedical text mining and its applications,” PLoS Computational Biology, vol. 5, no. 12, Article ID e1000597, 2009.
    [4] Meenakshi Mishra, Jun Huan, Said Bleik, Min Song – “Biomedical Text Categorization with Concept Graph Representations Using a Controlled Vocabulary”, 2012.
    [5] Zerida, Nadia Lucas, Nadine Crémilleux, Bruno – “Exclusion-inclusion based text categorization of biomedical articles”, 2007
    [6] Minsuk Lee, Weiqing Wang and Hong Yu – “Exploring supervised and unsupervised methods to detect topics in biomedical text”, 2006
    [7] Kiritchenko, Svetlana – “Hierarchical text categorization and its application to bioinformatics”, 2005
    [8] Cyrille YetuYetu Kesiku, Andrea Chaves-Villota and Begonya Garcia-Zapirain – “Natural Language Processing Techniques for Text Classification of Biomedical Documents: A Systematic Review”, 2022
    [9] Manirupa Das, Juanxi Li, Eric Fosler-Lussier, Simon Lin, Steve Rust, Yungui Huang and Rajiv Ramnath – “Sequence-to-Set Semantic Tagging for Complex Query Reformulation and Automated Text Categorization in Biomedical IR using Self-Attention”, 2020
    [10] Man LAN, Chew Lim TAN, Jian SU, Hwee Boon LOW – “Text Representations for Text Categorization: A Case Study in Biomedical Domain”, 2007


How to Cite This Article:
Mr. D Krishna, Erukulla Laasya, A Sowmya Sri, T Ravinder Reddy, Akhil Sanjoy .BIOMEDICAL TEXT DOCUMENT CLASSIFICATION . ijetms;7(3):788-792. DOI: 10.46647/ijetms.2023.v07i03.121