Please use this identifier to cite or link to this item:
https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/2568
Title: | Analysis of Speech Emotion Recognition Using Deep Learning Algorithm |
Authors: | Achary, Rathnakar Naik, Manthan S Pancholi, Tirth K |
Keywords: | CNN Convolution neural network Speech emotion recognition |
Issue Date: | 2023 |
Publisher: | Intelligent Communication Technologies and Virtual Mobile Networks : Proceedings of ICICV 2022 |
Citation: | Vol. 131; pp. 529-547 |
Abstract: | In this project, we propose an automated system for Speech emotion recognition using convolution neural network (CNN). The system uses a 5 layer CNN model, which is trained and tested on over 7000 speech samples. The data used is.wav files of speech samples. Data required for the anlysis is gathered from RAVDESS dataset which consists of samples of speech and songs from both male and female actors. The different models of CNN were trained and tested on RAVDESS dataset until we got the required accuracy. The algorithm then classifies the given input audio file of.wav format into a range of emotions. The performance is evaluated by the accuracy of the code and also the validation accuracy. The algorithm must have minimum loss as well. The data consists of 24 actors singing and speaking in different emotions and with different intensity. The experimental results gives an accuracy of about 99.8% and a validation accuracy of 93.33% on applying the five layer model to the dataset. We get an model accuracy of 92.65%. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. |
URI: | https://doi.org/10.1007/978-981-19-1844-5_42 http://gnanaganga.inflibnet.ac.in:8080/jspui/handle/123456789/2568 |
ISBN: | 9789811918438 9789811918445 |
ISSN: | 2367-4512 2367-4520 |
Appears in Collections: | Conference Papers |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.