Skip to content

Commit

Permalink
Updated readme
Browse files Browse the repository at this point in the history
  • Loading branch information
matthewcoole committed Oct 17, 2024
1 parent 419e83f commit d119b00
Showing 1 changed file with 73 additions and 4 deletions.
77 changes: 73 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,13 +1,82 @@
# llm-eval
Scripts and data for LLM evaluation.
This repository contains a reproducible workflow setup using [DVC](https://dvc.org/) backed by a [JASMIN object store](https://help.jasmin.ac.uk/docs/short-term-project-storage/using-the-jasmin-object-store/). Before working with the repository please contact [Matt Coole](mailto:[email protected]) to request access to the Jasmin object store `llm-eval-o`. Then follow the instructions below.

This repository is setup to work with [DVC](https://dvc.org/) backed by a [JASMIN object store](https://help.jasmin.ac.uk/docs/short-term-project-storage/using-the-jasmin-object-store/). Please follow the instruction in [`dvc.md`](dvc.md) to get up and running.
## Requirements
- [Ollama](https://ollama.com/download) ([`llama3.1`](https://ollama.com/library/llama3.1) and [`mistral-nemo`](https://ollama.com/library/mistral-nemo) models)

## DVC and CML
## Getting started
First create a new virtual environment and install the required dependencies:
```shell
python -m venv .venv
source .venv/bin/activate
pip install .
```
Next setup your local DVC configuration with your [Jasmin object store access key](https://help.jasmin.ac.uk/docs/short-term-project-storage/using-the-jasmin-object-store/#creating-an-access-key-and-secret):
```shell
dvc remote modify --local jasmin access_key_id '<ACCES_KEY_ID>'
dvc remote modify --local jasmin secret_access_key '<KEY_SECRET>'
```
Pull the data from the object store using DVC:
```shell
dvc pull
```
You should now be ready to re-run the pipeline:
```shell
dvc repro
```
This pipeline is defined in [`dvc.yaml`](dvc.yaml) and can be viewed with the command:
```shell
dvc dag
```
```
+----------------+
| fetch-metadata |
+----------------+
*
*
*
+------------------+ +-----------------------+
| extract-metadata | | fetch-supporting-docs |
+------------------+ +-----------------------+
** **
*** ***
** **
+------------+
| chunk-data |
+------------+
*
*
*
+-------------------+
| create-embeddings |
+-------------------+
*
*
*
+------------------+ +--------------------+
| generate-testset | | upload-to-docstore |
+------------------+ +--------------------+
** **
*** ***
** **
+------------------+
| run-rag-pipeline |
+------------------+
*
*
*
+----------+
| evaluate |
+----------+
```

## Notes

### DVC and CML
Notes on the use of Data Version Control and Continuous Machine Learning:
- [DVC](dvc.md)
- [CML](cml.md)

## vLLM
### vLLM
Notes on running models with vLLM:
- [vLLM](vllm.md)

1 comment on commit d119b00

@github-actions
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

context_precision: 0.48629908751134027
answer_correctness: 0.4976344431783619
answer_relevancy: 0.5595659289903719
context_recall: 0.4645794577234442

Please sign in to comment.