Skip to content
@FoundationVision

FoundationVision

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

    Python 6.2k 418

  2. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.4k 58

  3. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.1k 86

  4. Groma Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 579 61

  5. Infinity Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    322

  6. OmniTokenizer OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    Python 272 7

Repositories

Showing 10 of 12 repositories
  • Liquid Public

    Liquid: Language Models are Scalable Multi-modal Generators

    FoundationVision/Liquid’s past year of commit activity
    2 MIT 0 0 0 Updated Dec 12, 2024
  • Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    FoundationVision/Infinity’s past year of commit activity
    322 MIT 0 4 0 Updated Dec 11, 2024
  • FoundationVision/infinity.project’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 11, 2024
  • VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Python 6,158 MIT 418 36 1 Updated Dec 6, 2024
  • GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    FoundationVision/GLEE’s past year of commit activity
    Python 1,104 MIT 86 39 2 Updated Oct 21, 2024
  • LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    FoundationVision/LlamaGen’s past year of commit activity
    Python 1,396 MIT 58 50 0 Updated Aug 16, 2024
  • OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    FoundationVision/OmniTokenizer’s past year of commit activity
    Python 272 MIT 7 8 0 Updated Jul 9, 2024
  • vaex Public

    🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

    FoundationVision/vaex’s past year of commit activity
    Python 55 MIT 3 1 0 Updated Jun 23, 2024
  • Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    FoundationVision/Groma’s past year of commit activity
    Python 579 Apache-2.0 61 8 1 Updated Jun 7, 2024
  • GenerateU Public

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    FoundationVision/GenerateU’s past year of commit activity
    Python 148 6 13 0 Updated Mar 24, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python HTML