Speech Emotion Recognition Using Convolutional Neural Network (CNN)

Authors

  • Apoorv Singh NILL Author
  • Kshitij Kumar Srivastava NILL Author
  • Harini Murugan NILL Author

DOI:

https://doi.org/10.61841/xfmw8f98

Keywords:

Speech emotion, Deep learning, Tensor flow, CNN

Abstract

The Automated Speech Emotion Recognition is a tough process because of the gap among acoustic characteristics and human emotions, which depends strongly on the discriminative acoustic characteristics extracted for a provided recognition task. Different persons have different emotions and altogether a different way to express it. Speech emotion do have different energies, pitch variations are emphasized if considering different subjects. Therefore, the speech emotion detection is a demanding task in computing vision. Here, the speech emotion recognition is based on the Convolutional Neural Network (CNN) algorithm which uses different modules for the emotion recognition and the classifiers are used to differentiate emotions such as happiness, surprise, anger, neutral state, sadness, etc. The dataset for the speech emotion recognition system is the speech samples and the characteristics are extracted from these speech samples using LIBROSA package. The classification performance is based on extracted characteristics. Finally we can determine the emotion of speech signal.

 

Downloads

Download data is not yet available.

Published

31.10.2020

How to Cite

Singh, A., Srivastava, K. K., & Murugan, H. (2020). Speech Emotion Recognition Using Convolutional Neural Network (CNN). International Journal of Psychosocial Rehabilitation, 24(8), 2408-2416. https://doi.org/10.61841/xfmw8f98