Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression

This is an official repository for our paper, Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression.

@inproceedings{
    fu2024transformers,
    title={Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression},
    author={Deqing Fu and Tian-qi Chen and Robin Jia and Vatsal Sharan},
    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
    year={2024},
    url={https://openreview.net/forum?id=L8h6cozcbn}
}

Codes are mostly modified from this prior work.

Getting started

You can start by cloning our repository and following the steps below.

Install the dependencies for our code using Conda. You may need to adjust the environment YAML file depending on your setup.
```
conda env create -f environment.yml
conda activate transformers_icl_opt
```

Download model checkpoints and extract them in the current directory.

wget https://github.com/dtsip/in-context-learning/releases/download/initial/models.zip
unzip models.zip

Run probing for each Transformers layer
```
cd src
python probing.py
```
Compute Transformer's similarities to both Iterative Newton's Method and Gradient Descent
```
python eval_similarity.py
```
This will plot Fig. 1(a) and Fig. 3 in the paper, under a new folder eval.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
src		src
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression

Getting started

About

Releases

Packages

Languages

DeqingFu/transformers-icl-second-order

Folders and files

Latest commit

History

Repository files navigation

Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression

Getting started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages