International Journal of Engineering Technology and Management Sciences

2023, Volume 7 Issue 3

RECOGNIZING TAMIL CHARACTERS IN PALM LEAF MANUSCRIPTS (DEEP LEARNING)

AUTHOR(S)

Ms.J.Juslin Sega, Dr.J.Shiny Duela, Ms.Raghavi M

DOI: https://doi.org/10.46647/ijetms.2023.v07i03.038

ABSTRACT
Tamil is an ancient language that has a vast collection of literature written on palm leaves and other materials. Palm leaf manuscripts have been used as a versatile medium to record information on medicine, literature, theatre, and other subjects. Despite the need for digitization and transcription, recognizing cursive characters in palm leaf manuscripts remains a challenging task. This study introduces a novel Convolutional Neural Network (CNN) technique to train the characteristics of palm leaf characters, enabling CNN to significantly classify palm leaf characters during the training phase. Preprocessing of the input image is done using morphological operations to remove noise. Connected component analysis is a technique used in image processing to identify and label the individual connected regions, or components, in a binary image. Connected component Analysis is then used to segment the palm leaf characters, with feature processing including text line spacing, spacing without obstacle, and spacing with an obstacle. Finally, the extracted cursive characters are input into the CNN technique for final classification. Experiments are conducted using collected cursive Tamil palm leaf manuscripts to validate the performance of the proposed CNN with existing deep learning techniques in terms of accuracy, precision, recall, etc.

Page No: 293 - 298

References:

 [1] D. Ganapathy, “Preserving India’s palm Leaf Manuscripts for the Future,” WAGLOBAL, Kerala, India, 2016.
[2] N. S. Panyam, V. L. T.R., R. Krishnan, and K. R. N.V., “Modeling of palm leaf character recognition system using transform based techniques,” Pattern Recognition Letters, vol. 84, pp. 29–34, 2016.
 [3] K. P. Geena and G. Raju, “View-based feature extraction and classification approach to Malayalam palm leaf document image,” International Journal of Innovative Research in Computer and Communication Engineering, vol. 2, no. 5, pp. 264–267, 2014.
[4] R. Chamchong and C. C. Fung, “A framework for the selection of binarization techniques on palm leaf manuscripts using support vector machine,” Advances in Decision Sciences, vol. 2015, Article ID 925935, 7 pages, 2015.
[5] N. P. Challa and R. V. K. Mehta, “Applications of image processing techniques on palm leaf manuscripts-A survey,” in Proceedings of the Conference on Cognitive Science and Artificial Intelligence, CA, USA, February 2017.
[6] S. Athisayamani, A. R. Singh, and A. S. Kumar, “Recurrent neural network-based character recognition system for Tamil palm leaf manuscript using stroke zoning,” in Inventive Communication and Computational Technologies, pp. 165– 176, Springer, Singapore, 2021.
[7] R. S. Ratheash and M. M. Sathik, “A detailed survey of text line segmentation methods in handwritten historical documents and palm leaf manuscripts,” International Journal Of Computer Sciences And Engineering, vol. 7, p. 99, 2019.
[8] J. Ali and J. T. Joseph, “A convolution neural network-based approach for recognizing Malayalam handwritten characters,” Malayalam Handwritten character recognition using cnn, vol. 9, no. 12, 2018.
[9] P. K. S. Balakrishnan and L. Pavithira, “Multi-font optical character recognition using Deep Learning,” International Journal of Recent Technology and Engineering, vol. 8, 2019.
[10] M. A. Hossain and S. Afrin, “Optical character recognition based on template matching,” Global Journal of Computer Science and Technology: C Software & Data Engineering, vol. 19, 2019.
 [11] K. Baskar, “Manuscript Libraries in Tamil Nadu: a study of their organisation and preservation in the digital environment,” Doctoral Dissertation in Information Science, Madras University, Tamilnadu, 2018.
[12] S. Devika and K. Vijayakumar, “Digitization of palm leaf manuscripts in Tamil Nadu (India): a study,” Journal of Library Science and Research (JLSR), vol. 2, no. 1, pp. 1–10, 2016.
[13] R. Narenthiran and P. Ravichandran, “Cataloguing and digitization of multilingual manuscript libraries in Tamil Nadu: an evaluative study,” Journal of Advances in Library and Information Sciences, vol. 5, no. 3, pp. 248–253, 2016.
[14] T. K. M. Sageer and A. T. Francis, “Analysis of the palm leaves manuscripts collection for digital archiving: a case study of Sree Sankaracharya University of Sanskrit, Kalady,” Journal Impact Factor, vol. 5, no. 1389, pp. 90–99, 2016.
[15] R. S. Sabeenian, M. E. Paramasivam, P. M. Dinesh, R. Adarsh, and G. R. Kumar, “Classification of handwritten Tamil characters in palm leaf manuscripts using SVM based smart zoning strategies,” in Proceedings of the 2nd International Conference on Biomedical Signal and Image Processing, pp. 18–21, Kitakyushu, Japan, August 2017.
[16] B. Kiruba, A. Nivethitha, and M. Vimaladevi, “Segmentation of handwritten Tamil character from palm script using histogram program approach,” International Journal of Informative and Futuristic Research, vol. 4, no. 5, pp. 6418–6424, 2017.
[17] S. Ghosh, A. Mahajan, and S. Banerjee, “Palm leaf manuscript conservation, the process of seasoning with special reference to Saraswati Mahal library, Tamilnadu in India: some techniques,” International Journal of Information Movement, vol. 2, pp. 122–128, 2017.
[18] N. P. Challa and R. V. K. Mehta, “Applications of image processing techniques on palm-leaf manuscripts—a survey,” Helix: e Scientific Explorer, vol. 7, no. 5, pp. 2013–2017, 2017.
[19] R. Vinoth, R. Rajesh, and P. Yoganandhan, “Intelligence system for Tamil Vattezhuttuoptical character recognition,” International Journal of Computer Science Engineering and Technology, vol. 8, no. 4, pp. 22–26, 2017.
[20] M. Sornam and M. D. Poornima, “Tamil palm-leaf manuscript character segmentation using GLCM feature extraction,” International Journal of Computer Science and Engineering, vol. 6, no. 4, pp. 167–172, 2018.
[21] R. Devi Priya, R. Sivaraj, Ajith Abraham, T. Pravin, P. Sivasankar and N. Anitha. "MultiObjective Particle Swarm Optimization Based Preprocessing of Multi-Class Extremely Imbalanced Datasets". International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems Vol. 30, No. 05, pp. 735-755 (2022). Doi: 10.1142/S0218488522500209

How to Cite This Article:
Ms.J.Juslin Sega, Dr.J.Shiny Duela, Ms.Raghavi M .RECOGNIZING TAMIL CHARACTERS IN PALM LEAF MANUSCRIPTS (DEEP LEARNING) . ijetms;7(3):293-298. DOI: 10.46647/ijetms.2023.v07i03.038