Popular repositories Loading
-
meshed-memory-transformer
meshed-memory-transformer PublicMeshed-Memory Transformer for Image Captioning. CVPR 2020
-
dress-code
dress-code PublicDress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
-
multimodal-garment-designer
multimodal-garment-designer PublicThis is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
-
show-control-and-tell
show-control-and-tell PublicShow, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
-
novelty-detection
novelty-detection PublicLatent space autoregression for novelty detection.
Repositories
Showing 10 of 74 repositories
- ReflectiVA Public
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
aimagelab/ReflectiVA’s past year of commit activity - font_square Public
aimagelab/font_square’s past year of commit activity - aimagelab.github.io Public
aimagelab/aimagelab.github.io’s past year of commit activity