LLM Training Pipeline

This repo extends https://github.com/ZQZCalin/trainit. Instructions are available there.

Project Report: https://lucamezger.com/RSI_2024_Paper.pdf

New Functionality

implementation of more optimizers
cheating version for optimistic optimizers --> OFTRL (POC)
implementation of various hint methods for optimistic optimizers
gradient accumulation

Results

Benchmark Adam & discounted-FTRL (Adam with O2NC Framework)

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=<name> optimizer=ftrl

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=<name> #set weight decay to 0.0 for a fair comparison

https://wandb.ai/optimizedlearning/log1/reports/Benchmark-Adam-discounted-FTRL--Vmlldzo5MDU1NjUw

Hint Methods OFTRL

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=<name> optimizer=oftrl optimizer.beta3=0.5  optimizer.hint_method=0 #hint method between 0 and 20 (see optimizer/oftrl.py), beta3 is used for the hint calculations

If you use cheating, hint_method has to be a string, e.g., "cheating" (i.e. don't specify any hint method, otherwise it will overwrite the actual cheating hint) https://wandb.ai/optimizedlearning/log1/reports/Cheating-vs-Hints--Vmlldzo5MDUzODYx

Cheating POC (with two batches per iteration (2x gradient evaluations))

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=cheat_oftrl optimizer=oftrl train.use_cheat_hint=True

https://wandb.ai/optimizedlearning/log1/reports/Cheating-OFTRL-POC-Adam--Vmlldzo5MDU1NzAz

Gradient Accumulation 8 Batches

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=8batch_cheat_oftrl optimizer=oftrl train.use_cheat_hints=True train.accumulate_gradients=True train.accumulation_steps=8 train.use_amp=False optimizer.lr_config.lr=0.0024

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=8batch_ftrl optimizer=ftrl train.use_cheat_hints=False train.accumulate_gradients=True train.accumulation_steps=8 train.use_amp=False optimizer.lr_config.lr=0.0024

https://wandb.ai/optimizedlearning/log1/reports/Batch-size-8-Cheating-OFTRL-FTRL--Vmlldzo5MDYxMTY5

Gradient Accumulation different Batch Size Cheating OFTRL

python train_jax.py logging.wandb_project=<project-name> logging.wandb_name=4batch_cheat_oftrl optimizer=oftrl train.use_cheat_hints=True train.accumulate_gradients=True train.accumulation_steps=4 train.use_amp=False optimizer.lr_config.lr=0.0012

https://wandb.ai/optimizedlearning/log1/reports/Different-batch-sizes-Cheating-OFTRL--Vmlldzo5MDYxMzc4

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
batch_job		batch_job
conf		conf
loader		loader
model		model
optimizer		optimizer
serialize		serialize
.gitignore		.gitignore
README.md		README.md
check_env.py		check_env.py
logger.py		logger.py
logstate.py		logstate.py
performance_test.py		performance_test.py
requirements.txt		requirements.txt
scc_setup.sh		scc_setup.sh
train_jax.py		train_jax.py
train_neurips_pipeline.py		train_neurips_pipeline.py
train_torch.py		train_torch.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Training Pipeline

New Functionality

Results

Benchmark Adam & discounted-FTRL (Adam with O2NC Framework)

Hint Methods OFTRL

Cheating POC (with two batches per iteration (2x gradient evaluations))

Gradient Accumulation 8 Batches

Gradient Accumulation different Batch Size Cheating OFTRL

About

Releases

Packages

Contributors 3

Languages

Luca-Mezger/trainit

Folders and files

Latest commit

History

Repository files navigation

LLM Training Pipeline

New Functionality

Results

Benchmark Adam & discounted-FTRL (Adam with O2NC Framework)

Hint Methods OFTRL

Cheating POC (with two batches per iteration (2x gradient evaluations))

Gradient Accumulation 8 Batches

Gradient Accumulation different Batch Size Cheating OFTRL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages