Fair Entity Linking Benchmarks

This repository contains the entity linking benchmarks Wiki-Fair and News-Fair introduced in the EMNLP 2023 paper "A Fair and In-Depth Evaluation of Existing End-to-End Entity Linking Systems".

The benchmarks aim at providing a basis for a fairer comparison between different entity linkers. This is achieved, e.g., by including non-named entities in the ground truth, providing alternative mentions in cases where it is unclear what the best ground truth annotation is and using optional mentions for entities such as datetimes and quantities.

The annotation guidelines that were used to annotate the benchmarks can be found at https://github.com/ad-freiburg/entity-linking-annotation-guidelines

Description

Wiki-Fair contains 80 random Wikipedia articles from a Wikipedia dump from 2020. News-Fair contains 40 random news articles from a news crawl. In each article, 3 consecutive paragraphs were manually annotated. The remainder of each article is left unannotated (to cover a large variety of topics with an acceptable amount of manual annotation work) but kept in the benchmark to provide context for entity linkers.

Unlike most entity linking benchmarks, the benchmark contains not only named entities, but also non-named entities. Which kinds of entities were annotated was determined by a type whitelist. To ensure a fair comparison of different linkers, the benchmark includes alternative ground truth mentions in cases where it is unclear what the best ground truth annotation is. The benchmark also contains optional mentions, e.g., for datetimes and quantities. Coreferences are annotated, too, but there also exists a version without coreferences. A comprehensive list of annotation guidelines can be found in this repository.

Usage

The benchmarks are best used within the ELEVANT entity linking evaluation tool since ELEVANT automatically handles special features of these benchmarks such as alternative mentions, optional mentions or the evaluation of only the annotated part of the articles. The Wiki-Fair and News-Fair benchmarks are included per default in ELEVANT such that linking results on these benchmarks can be evaluated with ease. Please refer to the ELEVANT GitHub repository for simple instructions on how to work with ELEVANT.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
news-fair-no-coref.benchmark.jsonl		news-fair-no-coref.benchmark.jsonl
news-fair-no-coref.metadata.json		news-fair-no-coref.metadata.json
news-fair.benchmark.jsonl		news-fair.benchmark.jsonl
news-fair.metadata.json		news-fair.metadata.json
wiki-fair-no-coref.benchmark.jsonl		wiki-fair-no-coref.benchmark.jsonl
wiki-fair-no-coref.metadata.json		wiki-fair-no-coref.metadata.json
wiki-fair.benchmark.jsonl		wiki-fair.benchmark.jsonl
wiki-fair.metadata.json		wiki-fair.metadata.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fair Entity Linking Benchmarks

Description

Usage

About

Releases

Packages

ad-freiburg/fair-entity-linking-benchmarks

Folders and files

Latest commit

History

Repository files navigation

Fair Entity Linking Benchmarks

Description

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages