visual-question-anwsering

Here are 3 public repositories matching this topic...

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

deep-learning salesforce image-captioning deep-learning-library vision-framework vision-and-language multimodal-deep-learning multimodal-datasets vision-language-transformer vision-language-pretraining visual-question-anwsering

Updated Nov 18, 2024
Jupyter Notebook

nbtin / qa_web_demo

Star

A web app for both Text-based and Visual Question Answering.

machine-learning django deep-learning question-answering visual-question-anwsering

Updated Nov 13, 2023
Python

yousefkotp / Visual-Question-Answering

Star

A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder

machine-learning deep-learning vqa clip text-encoding image-and-text visual-question-answering vqa-dataset image-encoding vizwiz clip-model vizwiz-vqa visual-question-anwsering open-ai-clip vqa-2023

Updated Jun 27, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the visual-question-anwsering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the visual-question-anwsering topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly