The listing is in no particular order other than the year.
- BigScience pre-BLOOM 108B training experiments (2021): chronicles | the full spec and discussions (backup: 1 | 2)
-
BigScience BLOOM-176B (2022): chronicles-prequel | chronicles | the full spec and discussions (backup: 1 | 2 | 3)
-
THUDM GLM-130B (2022): en logbook | Mandarin version (backup: 1 | 2)
-
HuggingFace IDEFICS-80B multimodal (Flamingo repro) (2023): Learning log | Training Chronicles (backup: 1 | 2)
-
BloombergGPT 50B LLM - section C in BloombergGPT: A Large Language Model for Finance