Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene

Dataset & Features

Datasets：The proposed AVEPR and AVELPR datasets can be download from Google Drive link
Features
extract frame-level image from videos:

python extract_frames.py

extract 2D frame-level features:

python extract_3D_feat.py

extract 3D snippet-level features:

python extract_rgb_feat.py

extract audio samples from videos:

python extract_audio.py

extract audio features:

python wave audio_feature_extractor.py

(You can also extract audio features with PANNs)

conda env create -f IAVF.yml

Training

python main.py --gpu 0 --K 1 --dataset AVE --mode train --batch-size 32 --epochs 10

Testing

python main.py --gpu 0 --K 1 --dataset AVE --mode test --batch-size 32 --epochs 10

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
feature_extractor		feature_extractor
figs		figs
nets		nets
scripts		scripts
utils		utils
IAVF.yml		IAVF.yml
README.md		README.md
dataloader.py		dataloader.py
dataloader.pyc		dataloader.pyc
main.py		main.py
util_new_oldsample.py		util_new_oldsample.py