FRX

FRX (pronounced "freaks") is a Python package for training vision models from scratch in Jax.

Motivation

I worked on BioCLIP, a foundation vision model for the entire tree of life (450K+ different species). One of our ablations was comparing our CLIP-based objective to a traditional cross entropy classification objective. I felt that we did not aggressively tune the cross entropy training hyperparameters, and that it is not a 100% perfectly fair comparison (but science is never 100% perfect--BioCLIP is still a phenomenal piece of work of which I am immensely proud). I am curious if muP (see my notes on muP) along with Jax can lead to a stronger cross-entropy baseline.

With that in mind, one concrete research goal of FRX is to train a ViT-B/16 from scratch on TreeOfLife-10M using a cross-entropy classification objective and compare it to BioCLIP on biobench.

Personally, I am also interested in learning more about:

Jax, as well as multi-GPU and multi-node training in Jax.
muP transfer, including unit-scaled muP.
Large(r)-scale vision training.

Road Map

Demonstrate muP transfer of learning rate and std dev of weight initialization on ImageNet-1K using a model with size 128, 256 and 384 (384 is a ViT-S/16).
Do the same with TreeOfLife-10M.
Experiment with extreme classification methods.

muP Transfer

To demonstrate that muP transfer actually works, we have to tune the larger model using standard parameterization as well. However, we don't want to do this for actually big models--the whole point of muP is to avoid that. So we demonstrate that muP works by tuning models with a hidden dimension of 128, 256 and 384 with 12 layers. We also do muP tuning on the d=128 model then do zero-shot muP transfer to d=384 and compare it to the manually tuned d=384.

Experimental results are described here in the docs.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
experiments		experiments
frx		frx
.gitignore		.gitignore
.ignore		.ignore
README.md		README.md
download.py		download.py
justfile		justfile
pyproject.toml		pyproject.toml
sweep.py		sweep.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FRX

Motivation

Further Reading

Road Map

muP Transfer

About

Languages

samuelstevens/frx

Folders and files

Latest commit

History

Repository files navigation

FRX

Motivation

Further Reading

Road Map

muP Transfer

About

Resources

Stars

Watchers

Forks

Languages