Stars
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Entropy Based Sampling and Parallel CoT Decoding
Let your Claude able to think
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
GenEval: An object-focused framework for evaluating text-to-image alignment
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Janus-Series: Unified Multimodal Understanding and Generation Models
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Open weights LLM from Google DeepMind.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Evaluate your LLM's response with Prometheus and GPT4 💯
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Ongoing research training transformer models at scale
A curated list of reinforcement learning with human feedback resources (continually updated)
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Train transformer language models with reinforcement learning.