Scaling Semantic Search in PostgreSQL : Indexing and Searching at Scale with pgvector
Presented by:
Gunjan Juyal
No video of the event yet, sorry!
Summary
Vector databases are revolutionizing information-retrieval applications by bridging the semantic-information unlocked by LLMs to the power and convenience of semantic query search. But efficiently managing and querying vector embeddings within a relational database like PostgreSQL requires specialized knowledge, especially when talking about the scale and accuracy expected in a production scenario. This talk demystifies the process of scaling pgvector for large workloads, focusing on practical techniques but with brief dives into the algorithms powering this awesome extension.
What We Will Cover
- Index-creation scaling techniques: Quantization (with re-ordering), filtered indexes for low-cardinality dimensions, parallel index building
- Tuning index-type specific indexing parameters for recall (aka accuracy) vs TPS tradeoff: IVFFlat (lists) and HNSW (ef_construction)
- Tuning index-type specific querying parameters for recall (aka accuracy) vs TPS tradeoff: IVFFlat (probes) and HNSW (ef_search)
- Query scaling techniques: Distributed queries using foreign-data-wrapper based sharding.
- Handling data-drift: Mitigate performance drift caused by evolving datasets in IVFFlat and HNSW index types.
- Pgvector weak spots: Open areas where pgvector currently struggles, such as specific query shapes (e.g. high selectivity), data size limitations, and vector-size limitations.
- A brief overview of some techniques to mitigate these limitations to some extent: Quantization to support higher-dimensional vectors or larger datasets, high-selectivity queries etc
Key Takeaways
This intermediate-level session empowers GenAI application developers and DBAs with the tools and strategies needed to scale pgvector beyond prototype stage to unlock its full potential for high-performance vector search.
Target Audience
This talk is aimed at GenAI application developers, DBAs and those interested in vector-related algorithms who already have a basic understanding of concepts such as vectors and ANN search, and who are interested in digging deeper into pgvector to understand its feasibility for their large-scale datasets and workloads.
Presenters
- Gunjal Juyal is a Software Engineer at the Cloud SQL for PostgreSQL team in Google Cloud and is based out of Bangalore. (LinkedIn Profile)
- Eeshan Gupta is a Software Engineer at the Cloud SQL for PostgreSQL team in Google Cloud and is based out of Bangalore. (LinkedIn Profile)
- Date:
- Duration:
- 45 min
- Room:
- Conference:
- PGConf India, 2025
- Language:
- Track:
- Application Developer
- Difficulty:
- Hard