A small comparison of a few RAG evaluation frameworks using Jupyter notebooks for demonstration
Tested using Python 3.11.6
python -m venv .venv
source .venv/bin/activate
python -m pip install -r requirements.txt
The demonstrations were made to be looked at in the following order: