Source code of paper: "AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation", which is accepted for publication in International Journal of Computer Vision.
conda create -n AOENet python=3.8
conda activate AOENet
git clone https://github.com/UARK-AICV/AOE-Net.git
cd AOE-Net
conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch
pip install tqdm pandas tensorboard matplotlib fvcore scipy
We use CLIP to extract object texts of the center-frame of every snippet, for both ActivityNet-1.3 and THUMOS-14. and store there embedding features. Please download the features of the desired dataset and modify config file to point to respective directories that contain the features.
3D Resnet-50 features extracted from rescaled videos of ActivityNet-1.3 can be downloaded below:
- Environment features are here.
- Actor features are here.
- Object features are here.
- Annotations of Activitynet-1.3 can be downloaded from the official website.
TSN are applied for environment features and 3D Resnet-50 are applied for actors feature. All features can be downloaded below:
Default configurations of AOE-Net are stored in config/defaults.py. The modified configurations are stored in config/*.yaml for training and testing of AOE on different datasets (ActivityNet-1.3 and THUMOS-14). We can also modify configurations through command-line arguments.
- To train AOE-Net on TAPG task of ActivityNet-1.3 with 1 GPU:
python main.py --cfg-file config/anet_proposal_CLIP_v1.yaml MODE 'training' GPU_IDS [0]
- To evaluate AOE-Net on validation set of ActivityNet-1.3 with 1 GPU:
python main.py --cfg-file config/anet_proposals_CLIP_v1.yaml MODE 'validation' GPU_IDS [0]
This implementation is partly based on this pytorch-implementation of BMN for the boundary matching module and our previous work, named AEI.
paper: AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation (link will be updated soon).
1q. "UserWarning: This DataLoader will create 12 worker processes in total. Our suggested max number of worker in current system is #, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary."
1a. Change num_workers to # in line 171 of root>main.py>inference function
If you find AEI useful for your research, please consider citing:
@article{vo2021aei,
author = {Vo, Khoa and Joo, Hyekang and Yamazaki, Kashu and Truong, Sang and Kitani, Kris and Tran, Minh-Triet and Le, Ngan},
journal = {BMVC},
title = {{{AEI}: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation}},
year = {2021}
}
and
@article{vo2022aoe,
author={Vo, Khoa
and Truong, Sang
and Yamazaki, Kashu
and Raj, Bhiksha
and Tran, Minh-Triet
and Le, Ngan},
title={AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation},
journal={International Journal of Computer Vision},
year={2022},
month={Oct},
day={28},
issn={1573-1405},
doi={10.1007/s11263-022-01702-9},
url={https://doi.org/10.1007/s11263-022-01702-9}
}
Khoa Vo: