Reading and validating inputs #2

JMMackenzie · 2024-11-22T06:11:59Z

Extend the code in utils.py to include validation for inputs, and potentially handling containers of inputs (mapping a TREC run file to either a list of lists, or a dictionary of lists, etc).

Validation needs to ensure that:

Document ranks are obeyed -- ranks are assumed to be strictly increasing, but gaps are allowed for representing tied elements. For example, having a sequence of ranks like [1, 1, 1, 4, 5] should be valid,
Scores are used only as a diagnostic against provided ranks. If the ranks and scores do not agree, either warn or error.

The text was updated successfully, but these errors were encountered:

JMMackenzie · 2024-11-28T22:58:48Z

Input file types need to be coerced into the appropriate type given the metric of choice.

Rankings

If a qrel file is to be treated as a ranking, we can format it into an RBRanking with the highest grade as the first (tied) group, and so on.
A trec run is naturally a ranking.

Sets

A qrel file is naturally a set in some sense, but we will convert to positive/negative sets according to some cutoff.
A trec run is treated as all positive; things not in the run are negative implicitly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reading and validating inputs #2

Reading and validating inputs #2

JMMackenzie commented Nov 22, 2024

JMMackenzie commented Nov 28, 2024

Reading and validating inputs #2

Reading and validating inputs #2

Comments

JMMackenzie commented Nov 22, 2024

JMMackenzie commented Nov 28, 2024