Ensembled Corpus from the eHealth-KD 2019 Challenge

The corpus is made of 8000 automatically annotated sentences. It is organized as follows:

data/ensemble.txt, the collection of plain text sentences; one sentence per line.
data/ensemble.ann, the collection of annotations in brat standoff format (https://brat.nlplab.org/standoff.html).
data/ensemble.scr, the collection of agreement score between the ensembled systems in the corresponding sentences; one score per line.

License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
README.md		README.md