[FT] Adding caching for each dataset run #417

JoelNiklaus · 2024-12-02T22:25:41Z

Issue encountered

When running large evals with many dataset configurations it is very painful to rerun everything in case something fails.

Solution/Feature

It would be great if intermediate results could be cached, for example the computed metrics of each dataset.

punitvara · 2024-12-11T11:25:42Z

I am just looking out to contribute in HF repo. Just trying to understand code base and see if I can add this feature.

clefourrier · 2024-12-12T11:37:08Z

@punitvara We would want results to be saved after each batch run, and to be reused if an evaluation is launched with the exact same parameter configuration.
TBH, it's not a trivial PR to work on - if you're unfamiliar with lighteval, I would suggest working on #324, #325, or maybe #355 to get to know the code base first

JoelNiklaus added the feature request New feature/request label Dec 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FT] Adding caching for each dataset run #417

[FT] Adding caching for each dataset run #417

JoelNiklaus commented Dec 2, 2024

punitvara commented Dec 11, 2024 •

edited

Loading

clefourrier commented Dec 12, 2024

[FT] Adding caching for each dataset run #417

[FT] Adding caching for each dataset run #417

Comments

JoelNiklaus commented Dec 2, 2024

Issue encountered

Solution/Feature

punitvara commented Dec 11, 2024 • edited Loading

clefourrier commented Dec 12, 2024

punitvara commented Dec 11, 2024 •

edited

Loading