Skip to content

Commit

Permalink
Fix image
Browse files Browse the repository at this point in the history
  • Loading branch information
prrao87 committed Dec 26, 2023
1 parent 64b26f6 commit 1d133e1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion content/posts/vector-db-3/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -88,7 +88,7 @@ The PQ component can be broken down as per the diagram below. Each vector repres

The second stage is similar to k-means clustering, where a "codebook" of values is learned by minimizing the distance between the original vector and the quantized vector centroids. By mapping a large, high-dimensional vector into smaller, lower-dimensional subvectors, it is only the codebook of quantized values that is stored, making the memory footprint far smaller.

{{ figure(src="vector-db-pq.png" alt="Product quantization: Image inspired by <a href='https://www.pinecone.io/learn/series/faiss/product-quantization/'>Pinecone blog</a>) }}
{{ figure(src="vector-db-pq.png" alt="Product quantization: Image inspired by <a href='https://www.pinecone.io/learn/series/faiss/product-quantization/'>Pinecone blog</a>") }}

IVF is then applied to the PQ vector space -- for each point in the space there is a corresponding region, called a [Voronoi cell](https://en.wikipedia.org/wiki/Voronoi_diagram), consisting of all points in the space closer to the source point (seed) than to any other. These seed points are used to create an inverted index that correlates each centroid with a list of vectors in the space.

Expand Down

0 comments on commit 1d133e1

Please sign in to comment.