LAVIS - A One-stop Library for Language-Vision Intelligence
-
Updated
Nov 18, 2024 - Jupyter Notebook
LAVIS - A One-stop Library for Language-Vision Intelligence
A web app for both Text-based and Visual Question Answering.
A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
Add a description, image, and links to the visual-question-anwsering topic page so that developers can more easily learn about it.
To associate your repository with the visual-question-anwsering topic, visit your repo's landing page and select "manage topics."