Skip to content

[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

Notifications You must be signed in to change notification settings

UARK-AICV/AOE-Net

Repository files navigation

AOE-Net

PWC PWC

Source code of paper: "AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation", which is accepted for publication in International Journal of Computer Vision.

Installation Guide

conda create -n AOENet python=3.8
conda activate AOENet
git clone https://github.com/UARK-AICV/AOE-Net.git
cd AOE-Net
conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch
pip install tqdm pandas tensorboard matplotlib fvcore scipy

Download Features

We use CLIP to extract object texts of the center-frame of every snippet, for both ActivityNet-1.3 and THUMOS-14. and store there embedding features. Please download the features of the desired dataset and modify config file to point to respective directories that contain the features.

ActivityNet-1.3

3D Resnet-50 features extracted from rescaled videos of ActivityNet-1.3 can be downloaded below:

THUMOS-14

TSN are applied for environment features and 3D Resnet-50 are applied for actors feature. All features can be downloaded below:

  • Environment features are here.
  • Actor features are here.
  • Object features are here.

Training and Testing of AOE-Net

Default configurations of AOE-Net are stored in config/defaults.py. The modified configurations are stored in config/*.yaml for training and testing of AOE on different datasets (ActivityNet-1.3 and THUMOS-14). We can also modify configurations through command-line arguments.

  1. To train AOE-Net on TAPG task of ActivityNet-1.3 with 1 GPU:
python main.py --cfg-file config/anet_proposal_CLIP_v1.yaml MODE 'training' GPU_IDS [0]
  1. To evaluate AOE-Net on validation set of ActivityNet-1.3 with 1 GPU:
python main.py --cfg-file config/anet_proposals_CLIP_v1.yaml MODE 'validation' GPU_IDS [0]

Reference

This implementation is partly based on this pytorch-implementation of BMN for the boundary matching module and our previous work, named AEI.

paper: AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation (link will be updated soon).

Q&A

1q. "UserWarning: This DataLoader will create 12 worker processes in total. Our suggested max number of worker in current system is #, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary."

1a. Change num_workers to # in line 171 of root>main.py>inference function

Citation

If you find AEI useful for your research, please consider citing:

@article{vo2021aei,
 author = {Vo, Khoa and Joo, Hyekang and Yamazaki, Kashu and Truong, Sang and Kitani, Kris and Tran, Minh-Triet and Le, Ngan},
 journal = {BMVC},
 title = {{{AEI}: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation}},
 year = {2021}
}

and

@article{vo2022aoe,
author={Vo, Khoa
and Truong, Sang
and Yamazaki, Kashu
and Raj, Bhiksha
and Tran, Minh-Triet
and Le, Ngan},
title={AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation},
journal={International Journal of Computer Vision},
year={2022},
month={Oct},
day={28},
issn={1573-1405},
doi={10.1007/s11263-022-01702-9},
url={https://doi.org/10.1007/s11263-022-01702-9}
}

Contact

Khoa Vo:

About

[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Languages