ViP

Implementation for "Enhancing Sentence Representation with Visually-supervised Multimodal Pre-training"

Requirements

Python (>=3.5)
torch (>=1.1.0)
transformers (>=2.3.0)

Get Required Data

Flickr30K

Data Preprocessing

# For Flickr30K
cd datasets
python split_flickr_data.py


# ViP Pretraining
python vip_pretraining.py --cfg cfg/pretrain-flickr-resnet.yml

For SNLI

python unsupervised_nli.py --cfg cfg/unsupervised/snli.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/snli

For RTE

python unsupervised_nli.py --cfg cfg/unsupervised/rte.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/rte

For QNLI

python unsupervised_nli.py --cfg cfg/unsupervised/qnli.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/qnli

For MNLI

python unsupervised_nli.py --cfg cfg/unsupervised/mnli.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/mnli

For MNLI-mm

python unsupervised_nli.py --cfg cfg/unsupervised/mnli-mm.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/mnli-mm

For MRPC

python unsupervised_nli.py --cfg cfg/unsupervised/mrpc.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/mrpc

For QQP

python unsupervised_nli.py --cfg cfg/unsupervised/qqp.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/qqp

For QQP

python unsupervised_nli.py --cfg cfg/unsupervised/qqp.yml
python snli_unsupervised.py --data_folder ViP/unsupervised/flickr-resnet/qqp

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
cfg		cfg
datasets		datasets
models		models
LICENSE		LICENSE
README.md		README.md
back_translation.py		back_translation.py
coca.py		coca.py
do_test.py		do_test.py
snli_unsupervised.py		snli_unsupervised.py
unsupervised_nli.py		unsupervised_nli.py
vip_pretraining.py		vip_pretraining.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ViP

Implementation for "Enhancing Sentence Representation with Visually-supervised Multimodal Pre-training"

Requirements

Get Required Data

Data Preprocessing

For SNLI

For RTE

For QNLI

For MNLI

For MNLI-mm

For MRPC

For QQP

For QQP

About

Releases

Packages

Languages

License

gentlefress/ViP

Folders and files

Latest commit

History

Repository files navigation

ViP

Implementation for "Enhancing Sentence Representation with Visually-supervised Multimodal Pre-training"

Requirements

Get Required Data

Data Preprocessing

For SNLI

For RTE

For QNLI

For MNLI

For MNLI-mm

For MRPC

For QQP

For QQP

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages