Graphlet AI - Chatbot Class

This course covers generative AI, large language models (LLMs), vector search, retrieval aided generation (RAG), LLM fine-tuning, and more. The course is designed for software engineers, data scientists, and machine learning engineers who want to learn how to build AI-powered chatbots.

Shout Outs to Ivan Reznikov

This course owes great debt to the LangChain 101 course by Ivan Reznikov. I started with it and other content I created and combined them into a single course covering the topics below.

Course Essentials

Skill Prerequisites

Basic knowledge of Python
Basic CS and math skills

Course Outline

Introduction to Generative AI
Introduction to Large Language Models (LLMs)
Introduction to Vector Search
Introduction to Retrieval Aided Generation (RAG)
Introduction to LLM Fine-Tuning

Course Projects

Build a generative chatbot
Build a retrieval-based chatbot
Build a generative chatbot with retrieval-aided generation (RAG)
Build a generative chatbot with LLM fine-tuning

Course Materials

Course Slides
Course Code Repository (this repository)

Code Environment Setup

I provide a Docker image for this course that uses Jupyter Notebooks. Docker allows you to run the class's code in an environment precisely matching the one in which the code was developed and tested. You can also use the Docker image to run the course code in VSCode or another editor (see below).

In addition to Docker you can also setup an environment locally using the instructions below.

Install Docker

Install docker and then check the Get Started page if you aren't familiar.

There are several docker containers used in this course:

jupyter: Jupyter Notebook server where we will interactively write and run code.
neo4j: Neo4j graph database server where we will store and query graph data for prompt engineering and fine-tuning LLMs.
opensearch: OpenSearch server where we will store and query documents for RAG.

Docker Compose

Bring up the course environment with the following command:

docker compose up -d

Find the Jupyter Notebook url via this command:

docker logs jupyter -f --tail 100

Look for the url with 127.0.0.1 and open it. You should see the Jupyter Notebook home page.

NOTE: Insert an image of Jupyter home page for this course.

Docker and VSCode

NOTE: add instructions.

Code-Level Environment Setup

We use a Docker image to run the course, but you can also setup the environment so the code will work in VSCode or another editor. We provide a development tools setup using black, flake8, isort, mypy and pre-commit for you to modify and use as you see fit.

Install Anaconda Python

We use Anaconda Python, Python version 3.10.0, for this course. You can download Anaconda Python from here. Once you have installed Anaconda Python, you can create a new environment for this course by running the following command:

conda create -n chatbot-class python=3.10 -y

When you create a new environment or start a new shell, you will need to activate the chatbot-class conda environment with the following command:

conda activate chatbot-class

Now you are running Python 3.10 in the chatbot-class environment. To use this Python in VSCode, hit SHIFT-CMD-P (on Mac) and select Python: Select Interpreter. Then select the chatbot-class environment's Python.

To deactivate this environment, run:

conda deactivate

Other Virtual Environments

Note: I don't support other environments, but you can actually use any Python 3.10 if you are smart enough to make that work. :) You will need to manage your own virtual environments. Python 3's venv are easy to use.

To create a venv for the project, run:

python3 -m venv chatbot-class

To activate this venv run:

source chatbot-class/bin/activate

To deactivate this environment, run:

deactivate

Install Poetry for Dependency Management

We use Poetry for dependency management, as it makes things fairly painless.

Verify Poetry installation instructions here so you know the URL https://install.python-poetry.org is legitimate to execute in python3.

Then install Poetry with the following command:

curl -sSL https://install.python-poetry.org | python3 -

It is less "clean" in terms of environmental isolation, but alternatively you can install poetry via pip:

pip install poetry

Install Dependencies via Poetry

poetry install

Essential Tools

`langchain` (docs)

LangChain is a framework for developing applications powered by language models. It enables applications that:

Are context-aware: connect a language model to sources of context (prompt instructions, few shot examples, content to ground its response in, etc.)

Reason: rely on a language model to reason (about how to answer based on provided context, what actions to take, etc.)

The main value props of LangChain are:

Components: abstractions for working with language models, along with a collection of implementations for each abstraction. Components are modular and easy-to-use, whether you are using the rest of the LangChain framework or not

Off-the-shelf chains: a structured assembly of components for accomplishing specific higher-level tasks Off-the-shelf chains make it easy to get started. For complex applications, components make it easy to customize existing chains and build new ones.

`langchain-hub`

Taking inspiration from Hugging Face Hub, LangChainHub is collection of all artifacts useful for working with LangChain primitives such as prompts, chains and agents. The goal of this repository is to be a central resource for sharing and discovering high quality prompts, chains and agents that combine together to form complex LLM applications.

See example usage here.

`llama-index`

LlamaIndex is a "data framework" to help you build LLM apps. It provides the following tools:

Offers data connectors to ingest your existing data sources and data formats (APIs, PDFs, docs, SQL, etc.)

Provides ways to structure your data (indices, graphs) so that this data can be easily used with LLMs.

Provides an advanced retrieval/query interface over your data: Feed in any LLM input prompt, get back retrieved context and knowledge-augmented output.

Allows easy integrations with your outer application framework (e.g. with LangChain, Flask, Docker, ChatGPT, anything else).

LlamaIndex provides tools for both beginner users and advanced users. Our high-level API allows beginner users to use LlamaIndex to ingest and query their data in 5 lines of code. Our lower-level APIs allow advanced users to customize and extend any module (data connectors, indices, retrievers, query engines, reranking modules), to fit their needs.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
Lecture 1. Introduction to Langchain		Lecture 1. Introduction to Langchain
Lecture 2. Models		Lecture 2. Models
Part 1. Langchain		Part 1. Langchain
Part 2. Building Chatbots		Part 2. Building Chatbots
chatbot_class		chatbot_class
data		data
envs		envs
images		images
logs		logs
models		models
test		test
.dockerignore		.dockerignore
.env		.env
.flake8		.flake8
.gitignore		.gitignore
Dockerfile		Dockerfile
Introduction to NLP for Chatbot Course.pdf		Introduction to NLP for Chatbot Course.pdf
LICENSE		LICENSE
LangChain One Pager.webp		LangChain One Pager.webp
README.md		README.md
docker-compose.yml		docker-compose.yml
download.sh		download.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graphlet AI - Chatbot Class

Shout Outs to Ivan Reznikov

Course Essentials

Skill Prerequisites

Course Outline

Course Projects

Course Materials

Code Environment Setup

Install Docker

Docker Compose

Docker and VSCode

Code-Level Environment Setup

Install Anaconda Python

Other Virtual Environments

Install Poetry for Dependency Management

Install Dependencies via Poetry

Essential Tools

`langchain` (docs)

`langchain-hub`

`llama-index`

About

Releases

Packages

Languages

License

Graphlet-AI/chatbot-class

Folders and files

Latest commit

History

Repository files navigation

Graphlet AI - Chatbot Class

Shout Outs to Ivan Reznikov

Course Essentials

Skill Prerequisites

Course Outline

Course Projects

Course Materials

Code Environment Setup

Install Docker

Docker Compose

Docker and VSCode

Code-Level Environment Setup

Install Anaconda Python

Other Virtual Environments

Install Poetry for Dependency Management

Install Dependencies via Poetry

Essential Tools

langchain (docs)

langchain-hub

llama-index

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`langchain` (docs)

`langchain-hub`

`llama-index`

Packages