IJETMS LANDING PAGE

International Journal of Engineering Technology and Management Sciences

2023, Volume 7 Issue 1

Speech Based Emotion Recognition System

AUTHOR(S)

Sri Murugharaj B R, Shakthy B, Sabari L, Dr Kamaraj K

DOI: https://doi.org/10.46647/ijetms.2023.v07i01.050

ABSTRACT
Emotion reputation from speech alerts is a crucial yet difficult part of human-computer interaction (HCI). Several well-known speech assessment and type processes were employed in the literature on speech emotion reputation (SER) to extract emotions from warnings. Deep learning algorithms have recently been proposed as an alternative to conventional ones for SER. We develop a SER system that is totally based on exclusive classifiers and functions extraction techniques. Features from the speech alerts are utilised to train exclusive classifiers. To identify the broadest feasible appropriate characteristic subset, the feature choice (FS) procedure is performed. A number of device studying paradigms have been employed for the emotion-related task. Seven sentiments are first classified using a Recurrent Neural Network (RNN) classifier. Their outcomes are contrasted with those obtained using techniques such as Support Vector Machines (SVM) and Multivariate Linear Regression (MLR) , which are often employed in the area of spoken audio alert emotion recognition. The experimental statistics set requires the use of the Berlin and Spanish databases. This investigation demonstrates that the classifiers for the Berlin database attain an accuracy of 83% after applying Speaker Normalization (SN) and a characteristic selection to the functions. The RNN classifier for datasets that has no SN and no FS obtains a high accuracy of 94%.

Page No: 332 - 337

References:

    1. A. AsaeiCernaket.al., "Perceptual Information Loss because of Impaired Speech Production," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 12, pp. 2433-2443, Dec. 2017, doi: 10.1109/TASLP.2017.2738445.
    2. Y. Takashimaet.al., "Knowledge Transferability Between the Speech Data of Persons With Dysarthria Speaking Different Languages for Dysarthric Speech Recognition," in IEEE Access, vol. 7, pp. 164320-164326, 2019, doi: 10.1109/ACCESS.2019.2951856.
    3. J. Minget.al, "Speech Enhancement Based on Full-Sentence Correlation and Clean Speech Recognition," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 3, pp. 531-543, March 2017, doi: 10.1109/TASLP.2017.2651406.
    4. Đ. T. Grozdićet.al, "Whispered Speech Recognition Using Deep DenoisingAutoencoder and Inverse Filtering," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 12, pp. 2313-2322, Dec. 2017, doi: 10.1109/TASLP.2017.2738559.
    5. S. Deenaet.al., "Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 3, pp. 572- 582, March 2019, doi: 10.1109/TASLP.2018.2888814.
    6. H. Abdelaziz, "Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 3, pp. 475-484, March 2018, doi: 10.1109/TASLP.2017.2783545.
    7. T. Kawaseet.al., "Speech Enhancement Parameter Adjustment to Maximize Accuracy of Automatic Speech Recognition," in IEEE Transactions on Consumer Electronics, vol. 66, no. 2, pp. 125-133, May 2020, doi: 10.1109/TCE.2020.2986003.
    8. M. Kimet.al., "Regularized Speaker Adaptation of KL- HMM for Dysarthric Speech Recognition," in IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 25, no. 9, pp. 1581-1591, Sept. 2017, doi: 10.1109/TNSRE.2017.2681691.
    9. J. Denget.al., "SemisupervisedAutoencoders for Speech Emotion Recognition," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 1, pp. 31-43, Jan. 2018, doi: 10.1109/TASLP.2017.2759338.
    10. F. Taoet.al., "Gating Neural Network for Large Vocabulary Audiovisual Speech Recognition," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 7, pp. 1290-1302, July 2018, doi: 10.1109/TASLP.2018.2815268.

    How to Cite This Article:
    Sri Murugharaj B R, Shakthy B, Sabari L, Dr Kamaraj K . Speech Based Emotion Recognition System . ijetms;7(1):332-337. DOI: 10.46647/ijetms.2023.v07i01.050