Please use this identifier to cite or link to this item: https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/2568
Title: Analysis of Speech Emotion Recognition Using Deep Learning Algorithm
Authors: Achary, Rathnakar
Naik, Manthan S
Pancholi, Tirth K
Keywords: CNN
Convolution neural network
Speech emotion recognition
Issue Date: 2023
Publisher: Intelligent Communication Technologies and Virtual Mobile Networks : Proceedings of ICICV 2022
Citation: Vol. 131; pp. 529-547
Abstract: In this project, we propose an automated system for Speech emotion recognition using convolution neural network (CNN). The system uses a 5 layer CNN model, which is trained and tested on over 7000 speech samples. The data used is.wav files of speech samples. Data required for the anlysis is gathered from RAVDESS dataset which consists of samples of speech and songs from both male and female actors. The different models of CNN were trained and tested on RAVDESS dataset until we got the required accuracy. The algorithm then classifies the given input audio file of.wav format into a range of emotions. The performance is evaluated by the accuracy of the code and also the validation accuracy. The algorithm must have minimum loss as well. The data consists of 24 actors singing and speaking in different emotions and with different intensity. The experimental results gives an accuracy of about 99.8% and a validation accuracy of 93.33% on applying the five layer model to the dataset. We get an model accuracy of 92.65%. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
URI: https://doi.org/10.1007/978-981-19-1844-5_42
http://gnanaganga.inflibnet.ac.in:8080/jspui/handle/123456789/2568
ISBN: 9789811918438
9789811918445
ISSN: 2367-4512
2367-4520
Appears in Collections:Conference Papers

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.