Evaluation of Retrieval with GermanDPR #2779
-
Thear all, (1) How can I obtain the corpus you used? I.e. what specific Wikipedia dump was used in combination with GenSim's WikiCorpus(it will be nice if i can get the specific version of the corpus maybe as link)? I look forward to your feedback and thank you in advance. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
Hi @Zaker237 |
Beta Was this translation helpful? Give feedback.
-
Thank for the answer and the script. it really help me to understand the evaluation Process. but still now i'm facing some other problem. |
Beta Was this translation helpful? Give feedback.
Hi @Zaker237
Download links for GermanQuAD and GermanDPR are at the bottom of this page, including the test set: https://deepset.ai/germanquad The Wikipedia dump is from June 2019. The latest dump can be found in xml format here: https://dumps.wikimedia.org/dewiki/latest/ Older dumps you would need to find from the source yourself but I think they are also available. Maybe somebody at Wikimedia can help you.