MoE-ERAS Submission Repository

How do I replicate key results?

All our code can be run on PACE.

Connect to gatech VPN.
Access https://ondemand-ice.pace.gatech.edu/ and request and get a RHEL9 Interactive Desktop (from the top drop down) with H100.
Enter the VM's GUI and open up a terminal.
After setting up your git credentials, run git clone [email protected]:abhibambhaniya/mixtral-offloading-residency-info.git
cd mixtral-offloading-residency-info
bash initial_setup.sh
conda activate $TMP_DIR/moe-offload
jupyter notebook
cd notebooks
huggingface-cli download lavawolfiee/Mixtral-8x7B-Instruct-v0.1-offloading-demo --quiet --cache-dir $TMP_DIR --local-dir Mixtral-8x7B-Instruct-v0.1-offloading-demo
For performance gain results: Open up the speed up notebook and GO! :D
For quality results on wikitext/C4, run the respective python script. Ensure that step 10 completed successfully and verify that the local-dir matches the state_path in the script.

Repo MoE_Expert_Scheduler (huggingface/transformers fork)

Repo mixtral-offloading-residency-info (dvmazur/mixtral-offloading fork)

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
notebooks		notebooks
src		src
transformer-fork		transformer-fork
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
initial_setup.sh		initial_setup.sh
requirements.txt		requirements.txt