How Do You Perceive My Face? Recognizing Facial Expressions in Multi‐Modal Context by Modeling Mental Representations

How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations Florian Blume*, Runfeng Qu*, Pia Bideau, Martin Maier, Rasha Abdel Rahman, Olaf Hellwich

Link to paper: https://arxiv.org/abs/2409.02566

Requirements

Clone the repository
```
git clone [todo]
```

Create anaconda enviroment follow

cd context_matters
conda env create -f environment.yaml

Download the processed data under the links RAVDESS, mead
For evaluation and test purposes, we provide pre-trained model on RAVDESS and mead

Training

The training of the entire framework contains 3 steps

1. VAE face and audio spectrogram reconstruction training

Edit the data_path with the path of your dataset and the in_channel in Residual_vaegan.yaml according to your input data and run

python run.py --config configs/Residual_vaegan.yaml --command fit

2. Initial facial expression classifier training

Modify the ckpt in MLP_classify.yaml with the path of face reconstruction model you obtained in step 1 and run the following command

python run.py --config configs/MLP_classify.yaml --command fit

3. CAN network training

Compile the ckpts of backbone_1, backbone_2 and classifier in ia_attention_class.yaml with the models you trained in the last two steps. In the end run the command

python run.py --config configs/ia_attention_class.yaml --command fit

Testing

Edit the ckpt in ia_attention_class.yaml with the model you trained or the one we provide, and execute the command

python run.py --config configs/ia_attention_class.yaml --command eval

It will print the validation accuracy, followed by the test accuracy.

Image Generation

You can select one image and an audio file from the provided dataset and produce the merged face image using the following command

python interface.py --image path/to/image --audio path/to/audio

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Data		Data
callbacks		callbacks
configs		configs
evaluation		evaluation
experiments		experiments
models		models
README.md		README.md
dataset.py		dataset.py
environment.yaml		environment.yaml
interface.py		interface.py
run.py		run.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How Do You Perceive My Face? Recognizing Facial Expressions in Multi‐Modal Context by Modeling Mental Representations

Requirements

Training

1. VAE face and audio spectrogram reconstruction training

2. Initial facial expression classifier training

3. CAN network training

Testing

Image Generation

About

Releases

Packages

Languages

tub-cv-group/recognizing-by-modeling

Folders and files

Latest commit

History

Repository files navigation

How Do You Perceive My Face? Recognizing Facial Expressions in Multi‐Modal Context by Modeling Mental Representations

Requirements

Training

1. VAE face and audio spectrogram reconstruction training

2. Initial facial expression classifier training

3. CAN network training

Testing

Image Generation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages