Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Induction_heads_Experiments_Synthetic_dataset.ipynb		Induction_heads_Experiments_Synthetic_dataset.ipynb
README.md		README.md

Repository files navigation

In-context learning Secrets

Relevant papers:

In-context Learning and Induction Heads: This paper talks about an emerging phenomena in training transformers called induction heads. As the naming indicates, an induction heads refers to a special type of attention heads that is able to do "induction" reasoning.
In-Context Learning Creates Task Vectors
Transformers learn in-context by gradient descent
Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models
Looped Transformers as Programmable Computers
SUCCESSOR HEADS: RECURRING, INTERPRETABLE ATTENTION HEADS IN THE WILD
Transformers generalize differently from information stored in context vs in weights
Many-Shot In-Context Learning
Can language models learn from explanations in context?
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
The Developmental Landscape of In-Context Learning
IN-CONTEXT LANGUAGE LEARNING:ARCHITECTURES AND ALGORITHMS
[https://arxiv.org/abs/2402.11004](The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains)
[https://www.lesswrong.com/posts/j6s9H9SHrEhEfuJnq/causal-scrubbing-results-on-induction-heads](Causal scrubbing: results on induction heads)

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%