Fast Few-shot Debugging for NLU Test Suites

This repository contains the test suites used in the paper, "Fast Few-shot Debugging for NLU Test Suites". They are derived from test suites in HANS by Tom McCoy et al. and CheckList by Marco Ribeiro et al..

For each test suite, train.jsonl contains the ten examples used for debugging. The remainder of the test suite is used for testing, in test.jsonl. The file train-plus.jsonl contains the ten debugging examples together with the up to twenty newly misclassified examples collected from the original training set (MultiNLI, Quora Question Pairs, or SST-2).

Test extraction tool for CheckList

We also include scripts for extracting individual debugging examples from the Pickle files included in CheckList repositories. To run it, you should first install CheckList in your Python path following the instructions in the CheckList repository. In the CheckList source tree, you have Pickle files containing the original test suites in the release_suites directory. You may collect the ten debugging examples and remaining test examples for each suite with

mkdir qqp-suites
python src/qqp-tests.py release_suites/qqp_suite.pkl qqp-suites
mkdir sst2-suites
python src/sst2-tests.py release_suites/sentiment_suite.pkl sst2-suites

This procedure reproduces the training and testing files included in this repository, but we only include the suites where accuracy before debugging is worst.

Citation

If you use our few-shot test suites, please cite our ACL 2022 workshop paper:

@inproceedings{debugging,
  title={Fast Few-shot Debugging for NLU Test Suites}
  author={Malon, Christopher and Li, Kai and and Kruus, Erik},
  booktitle={Proceedings of Deep Learning Inside Out (DeeLIO): The 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures},
  address={Dublin, Ireland},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
hans		hans
qqp-checklist		qqp-checklist
src		src
sst2-checklist		sst2-checklist
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast Few-shot Debugging for NLU Test Suites

Test extraction tool for CheckList

Citation

About

Releases

Packages

Languages

License

necla-ml/debug-test-suites

Folders and files

Latest commit

History

Repository files navigation

Fast Few-shot Debugging for NLU Test Suites

Test extraction tool for CheckList

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages