TY - JOUR AU - Tumanyan, Narek T. PY - 2021/12/14 Y2 - 2024/03/28 TI - Deep Learning Approaches for Voice Emotion Recognition Using Sentiment-Arousal Space JF - Mathematical Problems of Computer Science JA - MPCS VL - 56 IS - SE - Articles DO - 10.51408/1963-0077 UR - http://mpcs.sci.am/index.php/mpcs/article/view/700 SP - 35-47 AB - <p align="justify">In this paper, we present deep learning-based approaches for the task of emotion recognition in voice recordings. A key component of the methods is the representation of emotion categories in a sentiment-arousal space and the usage of this space representation in the supervision signal. Our methods use wavelet and cepstral features as efficient data representations of audio signals. Convolutional Neural Network (CNN) and Long Short Term Memory Network (LSTM) architectures were used in recognition tasks, depending on whether the audio representation was treated as a spatial signal or as a temporal signal. Various recognition approaches were used, and the results were analyzed.</p> ER -