Change the repository type filter
All
Repositories list
12 repositories
Liquid
PublicLiquid: Language Models are Scalable Multi-modal GeneratorsInfinity
PublicInfinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesisinfinity.project
PublicVAR
Public[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!GLEE
Public[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
OmniTokenizer
Public[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.vaex
PublicGroma
Public[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual TokenizationGenerateU
Public[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object DetectionUniRef
Public[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces.github
Public