🚀 RAG Framework for Scalable Document Ingestion and Retrieval

Overview

Welcome to the RAG Framework, a cutting-edge solution designed for scalable document ingestion and retrieval. Leveraging distributed computing with Ray, this framework empowers users to seamlessly process vast amounts of documents in parallel across multiple CPU and GPU nodes. The inclusion of Qdrant disk-based indexing ensures support for the scale of billions of vectors, making it a robust choice for large-scale applications.

Demo

product.demo.mp4

Key Features

1. Distributed Computing with Ray 🌐

The RAG Framework employs Ray for distributed computing, enabling parallel document ingestion across multiple CPU and GPU nodes. This ensures optimal utilization of resources for efficient and scalable processing.

2. Qdrant Disk-Based Indexing 💽

To support the scale of billions of vectors, the framework integrates Qdrant disk-based indexing. This technology provides high-performance indexing capabilities, facilitating rapid and precise retrieval of relevant information.

3. REST APIs for Seamless Integration 🔄

RAG Framework offers REST APIs for convenient asset ingestion from popular sources such as S3 and GitHub. The APIs are also designed for efficient retrieval, ensuring a smooth and seamless integration into your existing workflows.

4. Ray Serve for API Scalability ⚙️

REST APIs are served using Ray Serve, allowing for easy scalability across multiple GPU and CPU nodes. This ensures that the framework adapts to the demands of your application, providing consistent performance even in dynamic environments.

5. Configurability at Your Fingertips 🛠️

The RAG Framework is highly configurable, allowing users to tailor the system to their specific needs. Key configuration options include the number of CPUs/GPUs to use, the choice of embedding model, chunk size, reranker model, and more.

Getting Started using docker

Follow these steps to get started with the RAG Framework:

Clone the repository
Configure your settings: Edit the configuration file (.env) to customize the framework based on your requirements. The sample .env is given in .env.example
Run using docker: docker compose up

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
api		api
jobs		jobs
schema		schema
utils		utils
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
constants.py		constants.py
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 RAG Framework for Scalable Document Ingestion and Retrieval

Overview

Demo

Key Features

1. Distributed Computing with Ray 🌐

2. Qdrant Disk-Based Indexing 💽

3. REST APIs for Seamless Integration 🔄

4. Ray Serve for API Scalability ⚙️

5. Configurability at Your Fingertips 🛠️

Getting Started using docker

About

Releases

Packages

Languages

License

kautshukla/ragswift

Folders and files

Latest commit

History

Repository files navigation

🚀 RAG Framework for Scalable Document Ingestion and Retrieval

Overview

Demo

Key Features

1. Distributed Computing with Ray 🌐

2. Qdrant Disk-Based Indexing 💽

3. REST APIs for Seamless Integration 🔄

4. Ray Serve for API Scalability ⚙️

5. Configurability at Your Fingertips 🛠️

Getting Started using docker

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages