GitHub - CerebrasResearch/nanoGNS: Minimal reference implementations for per-example gradient norm methods for computing GNS

CerebrasResearch / nanoGNS Public

Notifications You must be signed in to change notification settings
Fork 1
Star 6

Minimal reference implementations for per-example gradient norm methods for computing GNS

6 stars 1 fork Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
approx		approx
exact		exact
.gitignore		.gitignore
README.md		README.md

Repository files navigation

This repository contains the reference implementations of two related papers:

"Efficient and Approximate Per-Example Gradient Norms for Gradient Noise Scale" (NeurIPS WANT Workshop 2023)
- The code is available in the approx directory.
"Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers" (NeurIPS 2024) (arXiv)
- The code is available in the exact directory.

About

Minimal reference implementations for per-example gradient norm methods for computing GNS

Custom properties

Report repository

Releases

No releases published

Packages

No packages published

Contributors 3

Languages