AI Custom Chatbot Quickstart · The Basics

Technical Objectives

Ingest data into a vector database
Query the vector database
Query an agent that decides whether to query the vector database

Architecture Overview

Prerequisites

Clone this repository. Or branch it if you want to make your own edits.
Pinecone Vector Database. You can create a free account at Pinecone's website.
Open AI API account. You can sign up at Open AI's website.
- You will need to add a payment method to your account here.
Python (and your favorite IDE). We are using python v3.10.7.
Your favorite API client tool (we used Postman, but you can also use curl)

Set up your environment

Install dependencies: pip install -r requirements.txt
Start the app: python3 main.py
With your favorite API client tool, send a get request to the root endpoint (localhost:8000/)

If you receive the message “Hello World”, you are good to go 🎉

Ingesting Data

We leverage Llama Hub loaders to facilitate our ETL process. These loaders handle the tokenization and embedding for us. These loaders leverage Open AI's embedding model for the translations to a vector.

For more information on how Open AI's embedding model works, here's a good starting point.

Set up Infrastructure

We recommend setting up a Pinecone vector database. Many awesome vector databases exist, but Pinecone is a great starting point. Pinecone is a native vector database which increases the accuracy of search results. The database is managed and provides a dashboard out of the box.

Follow the Prerequisites steps if you haven't already.
Set up your vector database
- Create an index, give it a name.
- The index dimension is 1536. This is the number of output dimensions from Open AI's embedding model *text-embedding-ada-002*. Source
Update environment variables

Create a .env file that contains the following:

OPENAI_API_KEY=<insert OpenAI API key>
PINECONE_API_KEY=<insert Pinecone API Key>

**In the *config.py* file, you will need to update Pinecone information

PINECONE_INDEX=<name of your index>
PINECONE_ENVIRONMENT=<name of your pinecone environment, ex: asia-southeast1-gcp-free>

Run App

This developer kit contains a loader for scraping a website. This is located in *import_service.py*

Start the app: python3 main.py
Send a POST **request to the endpoint /load-website-docs with the following body:

{
  "page_urls": [""]
}

Example:

{
  "page_urls": [
    "https://focusedlabs.io",
    "https://focusedlabs.io/about",
    "https://focusedlabs.io/contact",
    "https://focusedlabs.io/case-studies",
    "https://focusedlabs.io/case-studies/agile-workflow-enabled-btr-automation",
    "https://focusedlabs.io/case-studies/hertz-technology-new-markets",
    "https://focusedlabs.io/case-studies/aperture-agile-transformation",
    "https://focusedlabs.io/case-studies/automated-core-business-functionality"
  ]
}

Outcome: You'll see the vector number increase in your Pinecone dashboard. Yay!!! Now you have data you can query.

Query Data

Search the Database

Starting with the bare minimum. First, we'll make sure we can query the database. This will execute semantic search on the data you've loaded. For more details on what semantic search with Pinecone looks like, start with this article

Start the app: python3 main.py
Send a POST **request to the endpoint /search-database with the following body:

{
  "text": ""
}

Example:

{
  "text": "What solutions did Focused Labs provide for Hertz?"
}

Outcome: You’ll receive an answer from the database.

Query an agent

Ok, you can retrieve data from the database. But what happens when a user asks unrelated questions like "who are you?" We need to add an agent. You can think of agents as the brain behind deciding what tool to use. Sometimes, you need to query the database. Sometimes you don't. The agent decides.

Here is an update to our Architecture Overview diagram showing the agent.

Start the app: python3 main.py
Send a POST **request to the endpoint /ask-agent with the following body:

{
  "text": ""
}

Example:

{
  "text": "Who are you?"
}

Outcome: You’ll receive an answer from the agent.

FAQ

If you run into a Rate Limit Error, you need to make sure your OpenAI account has credit available.
If you run into a PermissionError: [Errno 13] Permission denied: then make sure you are running your app with Python3
If you run into a MaxRetryError...Caused by SSLError when you are uploading your data to the vector database, wait another 5 minutes for your index to fully initialize and try again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

AI Custom Chatbot Quickstart · The Basics

Table of Contents

Goals

Why?

Technical Objectives

Architecture Overview

Prerequisites

Set up your environment

Ingesting Data

Set up Infrastructure

Run App

Query Data

Search the Database

Query an agent

FAQ

Files

README.md

Latest commit

History

README.md

File metadata and controls

AI Custom Chatbot Quickstart · The Basics

Table of Contents

Goals

Why?

Technical Objectives

Architecture Overview

Prerequisites

Set up your environment

Ingesting Data

Set up Infrastructure

Run App

Query Data

Search the Database

Query an agent

FAQ