Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List of corpora #255

Open
nschneid opened this issue Jun 20, 2023 · 0 comments
Open

List of corpora #255

nschneid opened this issue Jun 20, 2023 · 0 comments
Assignees

Comments

@nschneid
Copy link
Contributor

Enough AMR corpora are becoming available even for English that it is hard to keep track of all of them, and it is not always easy to find them based on the publication entries in our bibliography.

It may be time for a separate page listing corpora. This could start as a Google Sheet, with columns for language, dataset name, dialect (original, Dialogue-AMR, etc.), size of annotated data (tokens, AMRs), URL, and a reference to the publication entry.

Basically the principle would be: the Bibliography page lists things to read/cite, and the Corpus List page lists corpora to download.

@nschneid nschneid self-assigned this Jun 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant