about me

i specialize in building big systems to crunch through big data and develop big models.

i have previously developed one of the largest reinforcement learning systems at openai for openai five, along with llm training infrastructure at meta ai / fair that created opt-175b.

opt-175b was the first release in the industry to include:

a 175b parameter model for research use
a 114-page logbook detailing the challenges encountered during the 56 days it took to train a 175b llm on new hardware for the first time
the entire training codebase
a full suite of smaller-scale models ranging from 125M to 66B in size for studying scaling laws.

before getting into ai systems, i worked on scaling out data infrastructure and processing pipelines across various cloud providers.

you can refer to my linkedin for more xp info.

talks

in recent years, i have mainly presented talks on openai five and on opt-175b:

openai-five

Feb 12, 2022: Harvard CS50
March 18, 2022: Computer History Museum

opt-175b

October 21, 2022: Scale Transform X Conference - Top Tips from Netflix, NVIDIA, and Meta on Large Language Models
December 2, 2022: NeurIPS 2022 - Has It Trained Yet? Workshop
March 1, 2023: Stanford MLSys Seminar Series
April 1, 2023: CMU LLM Seminar

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

about me

talks

openai-five

opt-175b

publications

About

Releases

Packages

License

suchenzang/suchenzang.github.io

Folders and files

Latest commit

History

Repository files navigation

about me

talks

openai-five

opt-175b

publications

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages