Food AI: Cross-modal Representation Learning and Food Recipe Retrieval

Code for the CS 536: Machine Learning project at Rutgers University. Inspired by the CVPR paper Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Work Done

Built a system for retrieval of food recipes given images of corresponding food
Beat the CCA baseline top-10 recall for recipe retrieval in the original im2recipe paper by 20 percentage points by using ResNet and BERT feature extractors and introducing cross-modality through a shared embedding layer
Implemented a second approach using triplet loss trained neural networks and attained median retrieval rank of 1 and top-10 recall of 82.49% for 1,000 random food images

Tech Stack

Python 3
PyTorch
Hugging Face
seaborn

Report and Presentation

You can access the report here and presentation here.

Research Papers Referred

Structure and Acknowledgements

"Step1.ipynb": Contains code and retrieval results using the improved representation learning pipeline and CCA.
"Step2a.ipynb": Contains code pertaining to the neural network training and evaluation when trained with MSE Loss.
"Step2b.ipynb": Contains similar code but for networks trained using Triplet Loss with random negative sampling.
"Viz.ipynb": contains code and visualizations seen in the report.

Note: "Step-3.ipynb" is extra part of the project that aimed to enhance the explanibility of the retrieval system by leveraging attention mechanisms in ViT and BERT

Made as a team with @Neil-98, Aishwarya Harpale and Linqi Xiao

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
Step-3.ipynb		Step-3.ipynb
Step1.ipynb		Step1.ipynb
Step2a.ipynb		Step2a.ipynb
Step2b.ipynb		Step2b.ipynb
Viz.ipynb		Viz.ipynb
cs536_s22_group_15_final_presentation.pdf		cs536_s22_group_15_final_presentation.pdf
cs536_s22_group_15_final_report.pdf		cs536_s22_group_15_final_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Food AI: Cross-modal Representation Learning and Food Recipe Retrieval

Work Done

Tech Stack

Report and Presentation

Research Papers Referred

Structure and Acknowledgements

About

Releases

Packages

Languages

kunjmehta/cross-modal-retrieval-food-ai

Folders and files

Latest commit

History

Repository files navigation

Food AI: Cross-modal Representation Learning and Food Recipe Retrieval

Work Done

Tech Stack

Report and Presentation

Research Papers Referred

Structure and Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages