Emotion-Analysis-for-Conversational-Texts

Project Mentor
1. Dr Uthayasanker Thayasivam
Contributors
1. Piruntha Navanesan
2. Jarsigan Vickneswaran
3. Vahesan Vijayaratnam

Overview

We are developing a more effective and efficient system that would recognize the emotions of conversational texts. Effectiveness of the system would be addressing the high accuracy of the results while the efficiency would ideally provide the best results using a comparatively small data set.

Dataset

We were able to obtain a well-structured data set from Microsoft through the “EmoContext” competition. Data collection process done by the competition organizers is explained below.
Source of Dataset :
https://competitions.codalab.org/competitions/19790#learn_the_details-data-set-format

Source Code of the Model

The models were trained using Keras with TensorFlow backend.

Requirements

Tensorflow
Keras
nltk
Python 3.5 or above
OS: Ubuntu

Pre-Trained Embeddings

We are using customized FastText embedding as the pre-trained embedding model Which is a context-free word embedding trained with 322M tweets that are mostly emotion related. It generates better word embeddings for rare words, or even words not seen during training because it uses n-gram characters.

How to run

1- Install all necessary requirements.
2- Download source code from github and add them into a folder.
3- Download a pre-trained word embedding and add into the same folder.
4- Specify the embedding file name in the baseline file.
5- Run the following command to run the model.
python baseline_with_eval_With_Nltk.py -config testBaseline.config

Files Description

File Name	Description
baseline_with_eval_With_Nltk.py	Contains the code basics for the model
testBaseline.config	Contains main parameters
Train.txt	Contains Training data
Devwithoutlabels.txt	Contains test data
SolFile.txt	Contains the result data

Results

Emotion	Precision	Recall	Micro F1
Happy	0.696	0.750	0.722
Sad	0.472	0.760	0.751
Angry	0.716	0.795	0.754

Limitations in our final model

Emoji prediction is weak in our model.
Overfitting for some emotional related words.
Censored words are not handled.

Achievements

Achieved a best micro F1 value (0.7420) which betters the 3rd Quartile value of 0.7317 and stands up into the top quarter of the leaderboard of EmoContext competition.
When ranking the models in terms of recall for happy emotion, our model outperforms all other models.
Providing the simplest and easily referable emotion prediction model for future researchers.

Publication

Emotion Analysis for Conversational Texts

More references

Reference
Link

License

Apache License 2.0

Code of Conduct

Please read our code of conduct document here.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Best Result Model		Best Result Model
docs		docs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
baseline_with_eval_With_Nltk.py		baseline_with_eval_With_Nltk.py
testBaseline.config		testBaseline.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion-Analysis-for-Conversational-Texts

Overview

Dataset

Source Code of the Model

Requirements

Pre-Trained Embeddings

How to run

Files Description

Results

Limitations in our final model

Achievements

Publication

More references

License

Code of Conduct

About

Releases

Packages

Languages

License

aaivu/aaivu-emotion-analysis-for-conversational-texts

Folders and files

Latest commit

History

Repository files navigation

Emotion-Analysis-for-Conversational-Texts

Overview

Dataset

Source Code of the Model

Requirements

Pre-Trained Embeddings

How to run

Files Description

Results

Limitations in our final model

Achievements

Publication

More references

License

Code of Conduct

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages