ResearchGPT: AI Research Assistant

title	emoji	colorFrom	colorTo	sdk	sdk_version	app_file	pinned	license
ResearchGPT: AI Research Assistant	🚀	blue	purple	streamlit	1.39.0	app.py	false	mit

ResearchGPT: AI Research Assistant

This application is a Retrieval-Augmented Generation (RAG) based system that allows users to upload documents or provide links to documents (including arXiv papers), and then ask questions about the content of those documents. It supports both OpenAI API models and open-source LLMs like Mistral-7B-Instruct, giving users flexibility in choosing their preferred model.

Features

Upload PDF documents
Process document links, including arXiv papers
Summarize uploaded documents
Ask questions about uploaded documents
Flexible Model Selection:
- OpenAI GPT Models: High-performance option using OpenAI's API
- Open Source LLMs: Cost-effective alternative using Mistral-7B-Instruct
Dynamic model switching without losing context
Cached responses for better performance
Google Search Integration: Both models can perform web searches for recent or external information

Tech Stack

Core Technologies

FastAPI: Backend API framework
Streamlit: Frontend user interface
LlamaIndex: Core RAG implementation and query engines
Langchain: Document processing and model integrations
FAISS: Vector database for efficient similarity search
OpenAI API: For text embeddings and question answering
Hugging Face Hub: For accessing open-source LLMs
Poetry: For dependency management

Evaluation & Testing

RAGAS: For automated RAG pipeline evaluation
Pytest: For testing framework
GitHub Actions: For CI/CD and automated evaluation

Document Processing

PyMuPDF: For PDF processing
Sentence Transformers: For text embeddings and reranking
NLTK: For text processing and tokenization

Model Support

OpenAI Models

Default model: GPT-3.5-turbo
Suitable for: Production environments requiring high accuracy
Requires: OpenAI API key

Open Source Models

Default model: Mistral-7B-Instruct-v0.3
Suitable for: Development, testing, or cost-sensitive deployments
Requires: Hugging Face API key
Advantages: No usage costs, full control over the model

RAG Pipeline Evaluation

Evaluation Infrastructure

Automated evaluation pipeline using GitHub Actions
RAGAS metrics for comprehensive assessment
Continuous evaluation on pull requests
Detailed performance tracking and reporting

Evaluation Metrics

Faithfulness Score: Measures how accurately the generated answers reflect the source content
Answer Relevancy: Evaluates the semantic relevance of answers to questions
Context Precision: Assesses the accuracy of retrieved context
Context Recall: Measures the completeness of retrieved relevant information

Experimental Results

Current evaluation results for different RAG configurations:

Experiment	Faithfulness	Answer Relevancy	Context Precision	Context Recall
Classic VDB + Naive RAG	[1.0, 1.0, 1.0, 1.0]	[0.9806, 0.9933, 0.9985, 0.9718]	[1.0, 0.83, 1.0, 0.8]	[1.0, 1.0, 0.6666, 1.0]
Classic VDB + LLM Rerank	[1.0, 1.0, 1.0, 1.0]	[0.9806, 0.9901, 0.9933, 0.9713]	[1.0, 0.67, 0.67, 0.8]	[1.0, 1.0, 0.6666, 1.0]

RAG Configurations Tested

Classic VDB + Naive RAG
- Basic vector database retrieval with top-k=3
- Direct question answering with compact response mode
- Optimized for speed and simplicity
- Shows consistently high faithfulness and answer relevancy
Classic VDB + LLM Rerank
- Enhanced retrieval with top-k=5
- Uses cross-encoder model (ms-marco-MiniLM-L-12-v2) for reranking
- Improved precision through semantic reranking
- Maintains high faithfulness while slightly trading off context precision
MMR (Maximal Marginal Relevance)
- Balances relevance and diversity
- Configurable with mmr_threshold=0.7
Advanced Combinations
- Sentence Window Retrieval
- Multi-Query Expansion
- HyDE + Rerank
- Window + HyDE

Continuous Evaluation Pipeline

Our GitHub Actions workflow automatically:

Runs on pull requests to main branch
Executes comprehensive RAG evaluations
Generates CSV reports with detailed metrics
Posts results as PR comments
Archives evaluation artifacts

Setup

Clone the repository:

git clone https://github.com/yourusername/rag-based-qa-app.git
cd rag-based-qa-app

Install dependencies using Poetry:
```
poetry install
```

Set up environment variables: Create a .env file in the root directory:

OPENAI_API_KEY=your_openai_api_key_here  # Required for OpenAI models
HF_API_KEY=your_huggingface_api_key_here # Required for Mistral and other open-source models
GOOGLE_API_KEY=your_google_api_key_here  # Required for Google Search
GOOGLE_CSE_ID=your_google_cse_id_here    # Required for Google Search

Run the application:

Option 1: Run both services together:

poetry run start

Option 2: Run services separately:

# Terminal 1 - Backend
poetry run python main.py

# Terminal 2 - Frontend
poetry run streamlit run app.py

Usage

Upload a document or paste a document link (including arXiv links).
Select your preferred model provider (OpenAI or Local LLM) from the sidebar.
Wait for the document to be processed.
Click on "Summarize Document" to get an overview of the uploaded document.
Ask questions about the document in the provided text input.
View the AI-generated answers based on the document's content.

Google Search Integration

Both OpenAI and Mistral models can perform Google searches to find recent or external information when needed. This feature enhances the model's ability to provide up-to-date and comprehensive answers.

Model Selection Guide

When to Use OpenAI Models:

Need highest accuracy and performance
Working with complex academic papers
Require production-grade responses
Budget allows for API usage

When to Use Local LLMs (Mistral):

Development and testing
Cost-sensitive operations
Privacy concerns with external APIs
Need for offline capabilities
Sufficient for basic summarization and Q&A

Application Interface

Document Upload and Summary

Q&A Interface

Project Structure

researcher/
├── core/
│   ├── config/
│   │   └── model_config.py    # Model configurations
│   ├── routers/
│   │   └── document_routes.py # API endpoints
│   └── utils/
│       ├── vector_store.py    # FAISS operations
│       ├── rag_pipeline.py    # RAG implementation
│       └── text_processing.py # Text processing
├── testing/
│   └── data_preparation.py    # Test data generation
└── tests/
    ├── evaluation/
    │   └── ragas_evaluator.py # RAGAS evaluation
    ├── config/
    │   └── test_config.py     # Test configurations
    └── test_evaluation/
        └── test_experiments.py # Evaluation tests

Running with Docker

Build the Docker image:
```
docker build -t rag-qa-app .
```

Run the Docker container:

docker run -p 8000:8000 -p 8501:8501 --env-file .env rag-qa-app

Access the application:
- FastAPI backend: http://localhost:8000
- Streamlit frontend: http://localhost:8501

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. Areas of particular interest include:

Adding support for additional open-source models
Improving model response caching
Enhancing the RAG pipeline
UI/UX improvements
Adding new RAG configurations for evaluation
Improving evaluation metrics and benchmarks
Enhancing the RAGAS evaluation pipeline
Optimizing retrieval strategies based on evaluation results

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
images		images
notebooks		notebooks
researcher		researcher
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResearchGPT: AI Research Assistant

Features

Tech Stack

Core Technologies

Evaluation & Testing

Document Processing

Model Support

OpenAI Models

Open Source Models

RAG Pipeline Evaluation

Evaluation Infrastructure

Evaluation Metrics

Experimental Results

RAG Configurations Tested

Continuous Evaluation Pipeline

Setup

Usage

Google Search Integration

Model Selection Guide

When to Use OpenAI Models:

When to Use Local LLMs (Mistral):

Application Interface

Document Upload and Summary

Q&A Interface

Project Structure

Running with Docker

Contributing

License

About

Releases

Packages

Languages

shayanshafquat/researcher

Folders and files

Latest commit

History

Repository files navigation

ResearchGPT: AI Research Assistant

Features

Tech Stack

Core Technologies

Evaluation & Testing

Document Processing

Model Support

OpenAI Models

Open Source Models

RAG Pipeline Evaluation

Evaluation Infrastructure

Evaluation Metrics

Experimental Results

RAG Configurations Tested

Continuous Evaluation Pipeline

Setup

Usage

Google Search Integration

Model Selection Guide

When to Use OpenAI Models:

When to Use Local LLMs (Mistral):

Application Interface

Document Upload and Summary

Q&A Interface

Project Structure

Running with Docker

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages