llm_judge_experiments

This repo contains experiments using LLMs as a Judge. The following experiments have been run:

context relevance

The repository is structured as follows:

data: contains datasets used to evaluate experiments
notebooks: contains jupyter notebooks used to create and evaluate experiments

For invocation Haystack v2 has been used. Whenever API tokens or credentials are required, there are INSERT_TOKEN_HERE or similar placeholders in the notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data/context_relevance		data/context_relevance
notebooks/context_relevance		notebooks/context_relevance
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm_judge_experiments

About

Releases

Packages

Languages

License

deepset-ai/llm_judge_experiments

Folders and files

Latest commit

History

Repository files navigation

llm_judge_experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages