Skip to content

Commit

Permalink
add datasets
Browse files Browse the repository at this point in the history
  • Loading branch information
jagerliu committed Dec 12, 2019
1 parent d62bcec commit 708f0c4
Show file tree
Hide file tree
Showing 20 changed files with 158,107 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -102,6 +102,7 @@ useage: [--pretrained_model_path] - Path to the pre-trained model parameters.
### Classification benchmarks

Accuracy (dev/test %) on different dataset:

| Dataset | HowNet | CnDbpedia |
| :----- | :----: | :----: |
| Book review | 88.75/87.75 | 88.80/87.69 |
Expand All @@ -117,6 +118,7 @@ Accuracy (dev/test %) on different dataset:
### NER example

Run an example on the msra_ner dataset with CnDbpedia:

```
CUDA_VISIBLE_DEVICES='0' nohup python3 -u run_kbert_ner.py \
--pretrained_model_path ./models/google_model.bin \
Expand Down
5 changes: 5 additions & 0 deletions datasets/.gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
finance_qa/
xnli/
weibo/
lcqmc/
law_qa/
10,001 changes: 10,001 additions & 0 deletions datasets/book_review/dev.tsv

Large diffs are not rendered by default.

10,001 changes: 10,001 additions & 0 deletions datasets/book_review/test.tsv

Large diffs are not rendered by default.

20,001 changes: 20,001 additions & 0 deletions datasets/book_review/train.tsv

Large diffs are not rendered by default.

1,201 changes: 1,201 additions & 0 deletions datasets/chnsenticorp/dev.tsv

Large diffs are not rendered by default.

1,201 changes: 1,201 additions & 0 deletions datasets/chnsenticorp/test.tsv

Large diffs are not rendered by default.

9,601 changes: 9,601 additions & 0 deletions datasets/chnsenticorp/train.tsv

Large diffs are not rendered by default.

2,911 changes: 2,911 additions & 0 deletions datasets/financial_ner/dev.tsv

Large diffs are not rendered by default.

3,067 changes: 3,067 additions & 0 deletions datasets/financial_ner/test.tsv

Large diffs are not rendered by default.

23,676 changes: 23,676 additions & 0 deletions datasets/financial_ner/train.tsv

Large diffs are not rendered by default.

940 changes: 940 additions & 0 deletions datasets/medical_ner/dev.tsv

Large diffs are not rendered by default.

756 changes: 756 additions & 0 deletions datasets/medical_ner/test.tsv

Large diffs are not rendered by default.

6,920 changes: 6,920 additions & 0 deletions datasets/medical_ner/train.tsv

Large diffs are not rendered by default.

2,319 changes: 2,319 additions & 0 deletions datasets/msra_ner/dev.tsv

Large diffs are not rendered by default.

4,637 changes: 4,637 additions & 0 deletions datasets/msra_ner/test.tsv

Large diffs are not rendered by default.

20,865 changes: 20,865 additions & 0 deletions datasets/msra_ner/train.tsv

Large diffs are not rendered by default.

10,001 changes: 10,001 additions & 0 deletions datasets/shopping/dev.tsv

Large diffs are not rendered by default.

10,001 changes: 10,001 additions & 0 deletions datasets/shopping/test.tsv

Large diffs are not rendered by default.

20,001 changes: 20,001 additions & 0 deletions datasets/shopping/train.tsv

Large diffs are not rendered by default.

0 comments on commit 708f0c4

Please sign in to comment.