Skip to content

Latest commit

 

History

History
25 lines (14 loc) · 976 Bytes

README.md

File metadata and controls

25 lines (14 loc) · 976 Bytes

#1D Speech Emotion Recognition

Speech Emotion Recognition using raw speech signals from the EmoDB database using 1D CNN-LSTM architecture as given in the following paper.

Zhao, Jianfeng, Xia Mao, and Lijiang Chen. "Speech emotion recognition using deep 1D & 2D CNN LSTM networks." Biomedical Signal Processing and Control 47 (2019): 312-323.

EmoDB database can be downloaded from the following website

http://www.emodb.bilderbar.info/download/

There are 7 emotional classes and the validation accuracy obtained is ~61% as mentioned in the paper.

Developed and tested on the following:

Python 2.7 keras 2.2.4 Librosa 0.6.2

Update: 07-05-2021

Modifed the cnn1d.py architecture with attention mechanism (cnn1d_attn.py), shows better performance in terms of accuracy (66%-70% with lr 0.01).

##################################

#For another computational paralinguistic task, verbal conflict intensity estimation, see the repo https://github.com/smartcameras/ConflictNET