title

software

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

Enhancing In-context Learning via Linear Probe Calibration

https://github.com/mominabbass/LinC

In-context learning (ICL) is a new paradigm for natural language processing that utilizes Generative Pre-trained Transformer (GPT)-like models. This approach uses prompts that include in-context demonstrations to generate the corresponding output for a new query input. However, applying ICL in real cases does not scale with the number of samples, and lacks robustness to different prompt templates and demonstration permutations. In this paper, we first show that GPT-like models using ICL result in unreliable predictions based on a new metric based on Shannon entropy. Then, to solve this problem, we propose a new technique called the Linear Probe Calibration (LinC), a method that calibrates the model’s output probabilities, resulting in reliable predictions and improved performance, while requiring only minimal additional samples (as few as five labeled data samples). LinC significantly enhances the ICL test performance of GPT models on various benchmark datasets, with an average improvement of up to 21%, and up to a 50% improvement in some cases, and significantly boosts the performance of PEFT methods, especially in the low resource regime. Moreover, LinC achieves lower expected calibration error, and is highly robust to varying label proportions, prompt templates, and demonstration permutations.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

abbas24a

0

Enhancing In-context Learning via Linear Probe Calibration

307

315

307-315

307

false

Abbas, Momin and Zhou, Yi and Ram, Parikshit and Baracaldo, Nathalie and Samulowitz, Horst and Salonidis, Theodoros and Chen, Tianyi

given	family
Momin	Abbas

given	family
Yi	Zhou

given	family
Parikshit	Ram

given	family
Nathalie	Baracaldo

given	family
Horst	Samulowitz

given	family
Theodoros	Salonidis

given	family
Tianyi	Chen

2024-04-18

Proceedings of The 27th International Conference on Artificial Intelligence and Statistics

238

inproceedings

date-parts

2024

4

18

https://proceedings.mlr.press/v238/abbas24a/abbas24a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024-04-18-abbas24a.md

2024-04-18-abbas24a.md

Files

2024-04-18-abbas24a.md

Latest commit

History

2024-04-18-abbas24a.md

File metadata and controls