Program for PGConf India, 2026
Experimenting with a Global Index in PostgreSQL: Design, Implementation, and Challenges
presented by Dilip Kumar from Google
The main limitation is that unique keys must include the partition key, which can be restrictive for many use cases. A global index addresses this issue by enabling unique indexes that span the entire partitioned hierarchy, removing the need for partition key inclusion in primary keys.
In this talk, we share our experience experimenting with the implementation of global indexes in PostgreSQL...
moreFair Coffee, Fair Data: Practical Bias Detection in Coffee Supply Chains with PostgreSQL
presented by Koji Annoura from Independent
PostgreSQL is evolving beyond its traditional relational roots by adopting property graph query capabilities through SQL/PGQ. This extension enables graph-style pattern matching and traversal directly within PostgreSQL, removing the need for separate graph database systems. For database practitioners, this opens the door to analyzing complex relationships while continuing to rely on PostgreSQL’...
more Application DeveloperCase Study: Understanding the importance of Commitlog through a database corruption
presented by Amit Singh from EnterpriseDB
Database corruption occurs when the data stored in a database becomes inconsistent, unreadable, or lost due to various factors. I will cover some database internals at a high level which are needed to understand the cause corruption and the potential solutions to address a specific corruption scenario. In this talk I will dissect a database corruption incident and the steps performed to re...
more Case StudyHow we made WarehousePG Open Source (again)
presented by Andreas Scherbaum from PostgreSQL Usergroup Europe
WarehousePG is an Open Source fork of Greenplum Database, which by itself is a fork of PostgreSQL. The project was born after the upstream project was made closed source.
This talk gives an overview of the history of both projects, which already spans more than two decades. We then dive into the reasons for creating a fork, and all the stumbling stones we had to pass in order to make this pr...
more Application DeveloperSQL Injection Is Boring—Advanced Threats You’re Not Watching
presented by Kranthi Kiran Burada from Amazon Web Services
Everyone knows how to prevent basic SQL injection—but modern attackers have moved far beyond textbook exploits. In high-traffic PostgreSQL deployments, subtle misconfigurations and overlooked features can open doors to far more sophisticated attacks. This talk uncovers the next generation of database threats that rarely make it into security checklists. We’ll examine: * Privilege Escalation v...
more Database AdministrationAWSM FSM!
presented by Nikhil Sontakke from Microsoft
PostgreSQL’s Free Space Map (FSM) is a critical internal structure that helps the database efficiently manage and reuse space within tables, directly impacting performance and storage utilization. In this talk, we’ll demystify how FSM works, explore its role in common operations like inserts, updates, and VACUUM, and discuss practical strategies for monitoring and tuning FSM for real-world work...
more Database Engine DevelopersHow Postgres can leverage disk bandwidth for better TPS.
presented by Nikhil Chawla from Microsoft
This talk demonstrates how running Postgres on log-structured (LSM) file systems, for example F2FS, can significantly improve throughput in write-heavy workloads.
I'll show how I identified surprising bottlenecks in high throughput Postgres systems, where the main issue turned out to be the number of IOPS, due to small random writes. The system could not utilize the full storage bandwid...
more Application DeveloperOperational hazards of managing PostgreSQL DBs over 100TB
presented by Teresa Lopes from Adyen
How do you backup (and restore) a +100TB database? Well, maybe you don't.
In this talk I will share the singularities I encountered when managing huge PostgreSQL databases, like backups, high availability challenges, how to keep vacuum under control...
When reading blog articles, the best practices, the "how to" guides, things seem straightforward, but when you start bending PostgreSQL li...
more Case StudyPG18 Hacktober: 31 Days of New Features
presented by Hari Kiran from OpenSource DB
While September 2025 last week saw the PG18 release, we dedicated October on a focused initiative: "PG18 Hacktober: 31 Days of New Features" to explore and highlight 31 new features/changes in the new PostgreSQL 18 version. This talk presents the outcomes of that project, highlighting both practical enhancements and experimental developments that demonstrate PostgreSQL 18’s features.
The ses...
more Database AdministrationDeveloping PostgreSQL Extensions in C: Hooks, Shared Memory & Best Practices
presented by Suresh Dash from Fujitsu
PostgreSQL’s extensibility empowers developers to build powerful, custom functionality directly within the database. Writing extensions in C allows maximum performance and integration, but requires a deep understanding of PostgreSQL’s internal architecture, memory management, and hook mechanisms.
This session provides a comprehensive, hands-on exploration of PostgreSQL extension development,...
more Database Engine DevelopersExternal Proxies and Poolers - A reality check in todays tech stack
presented by Jobin Augustine from Percona
For decades, the standard PostgreSQL playbook has included an external connection pooler like PgBouncer or PgPool-II and proxies like HAProxy for service discovery, which we won’t see generally with other database installations like Oracle or SQLServer. This architectural pattern, mainly born from the high cost of process forking in older operating systems, is often treated as gospel. But what ...
more Database AdministrationDocument Workloads: PostgreSQL JSONB and MongoDB
presented by Franck Pachot from MongoDB
PostgreSQL was among the first relational databases to introduce a JSON data type, emerging at roughly the same time MongoDB was gaining traction as a NoSQL document database. While both PostgreSQL's JSONB and MongoDB's BSON formats store JSON, they differ significantly in their implementations and goals: PostgreSQL adheres to SQL, stores data as rows within blocks, and relies on extensions to ...
more Database AdministrationPostgres at Scale: Lessons from Running Multi-Terabyte Clusters
presented by Roneel Kumar, Sameer Kumar from AWS
PostgreSQL has matured into one of the most trusted databases for mission-critical workloads, but scaling it to tens or even hundreds of terabytes is a different game altogether. At this scale, the challenges shift from simple tuning to deep architectural decisions: vacuum pressure, index bloat, replication lag, and the limits of storage and I/O subsystems all start to matter.
In this talk...
more Case StudyWhat could possibly go wrong if you try to load 1 Petabyte into PostgreSQL
presented by Michael Zhilin from Postgres Professional
At the end of year 2024, we decided to run a small, quick test of the PostgreSQL-based distributed database called Shardman on 1 petabyte of data. The plan was to rent the hardware, run one of popular benchmark, and prepare a report with the results. But from the very beginning, nothing went according to plan.
In this talk, we will share:
- The real story of a single benchmark test
- Our...
From Disk to Data: Exploring PostgreSQL's Buffer Management
presented by Lalit Choudhary from PurnaBIT
Get a high-level Overview of why the Buffer Manager is critical for performance, acting as the gateway between disk and memory.
Explore the core Buffer Manager Structure, including the shared buffer pool and the key data structures that manage it.
Understand the role of Buffer Manager Locks in maintaining consistency in a highly concurrent multi-process architecture.
Walk th...
The End of ANALYZE Hell: PostgreSQL 18's Statistics Preservation Story
presented by Rajkumar Raghuwanshi, Nishad Mankar from AWS
PostgreSQL 18 introduces a game-changing feature that significantly improves the database upgrade experience by preserving optimizer statistics during major version upgrades. This talk will dive deep into the technical implementation and benefits of statistics retention, focusing on how it eliminates the need for lengthy post-upgrade ANALYZE operations that were previously required. We'll expl...
more Database AdministrationFinding Hidden Memory Bugs in PostgreSQL with Custom Memory Allocators
presented by Samuel Cherukutty from Zoho Corp
Talk Proposal: Finding Hidden Memory Bugs in PostgreSQL with Custom Memory Allocators
Track: Database Engine Developers
Level: Intermediate to Advanced
Duration: 45 minutes (30 min presentation + 15 min Q&A)
Abstract
Memory safety bugs in PostgreSQL are notoriously difficult to detect. While tools like Valgrind and AddressSanitizer (ASan) help, PostgreSQL's A...
more Database Engine DevelopersGuarding PII in PostgreSQL: Leveraging postgresql-anonymizer for Secure & Compliant Data in All Environments
presented by Boobathi Parameswaran from AWS
Core Use Case: Comprehensive Data Privacy Management: From Secure Test Data Generation to Dynamic Access Control in Production
Description: This session provides a deep dive into postgresql-anonymizer, showcasing its role as a powerful, production-ready solution for robust PII/PHI protection across the entire data lifecycle. Leveraging features from its 2.0 release—including flexib...
moreBuilding OAuth2 Validator Extension for PostgreSQL 18
presented by Ajit Awekar from EnterpriseDB
PostgreSQL 18 introduces native support for OAuth 2.0 authentication via the SASL OAUTHBEARER mechanism, elevating its position as a modern resource server. This new feature relies on a pluggable validator module for external token verification, a critical extension point for integrating with diverse Identity Providers (IdPs) like Okta, Keycloak, or custom Single Sign On solutions.
This talk...
more Database Engine DevelopersUnderstanding Locking and Concurrency in PostgreSQL
presented by Pavan Deolasee from EnterpriseDB
This session takes a deep dive into how PostgreSQL manages concurrent access through its sophisticated locking and concurrency control mechanisms. We’ll start by understanding why locks exist in the first place, how they ensure data consistency when multiple clients read and modify the same data, and then explore the main types of locks at both the row and table level. You’ll learn how the lock...
more Database Engine DevelopersA Deep Dive into SSI: Serializable Snapshot Isolation in PostgreSQL
presented by Vaijayanti Bharadwaj from EnterpriseDB
Introduction
Till 2011, Serializable isolation level in PostgreSQL was in fact Snapshot Isolation(SI). SI can allow some types of write skew anomalies with workloads which read and write data related by constraints concurrently. Traditional methods to implement Serializable isolation level use strict 2-phase locking (S2PL), which is a pessimistic algorithm and involves taking locks on re...
more Database Engine DevelopersBeyond shared_buffers: On-Demand Memory in Modern PostgreSQL
presented by Vaibhav Popat from Google
In this talk, we will cover about Dynamic Shared Memory Segments - an alternative to the main shared memory , where memory can be requested even after PG startup.
Problem -
PG uses shared memory for inter-process communication among processes that need to access and share data with each other. But, the postmaster allows requesting any bytes in the main shared memory, only during...
more Database Engine DevelopersHard-won lessons after processing 6.7T events through Postgre queues
presented by Rohith BCS from RudderStack
Abstract
While many organizations reach for specialized streaming systems like Apache Kafka for high-throughput event processing, RudderStack chose a different path: PostgreSQL. This talk chronicles six years of battle-tested lessons learned while scaling PostgreSQL from a simple queue to a system processing 100,000 events per second, and delivering total 6.7T events. I'll share specific...
more Case StudyConflict Detection and Resolution: Strengthening PostgreSQL for Distributed Workloads
presented by Nisha Moond from Fujitsu
Logical replication continues to evolve as a key mechanism for data distribution in PostgreSQL. However, when multiple nodes modify the same data, inconsistencies/conflicts can arise. PostgreSQL 18 takes a major step forward by introducing built-in conflict detection, laying the groundwork for automated conflict management in future releases.
This talk will cover:
- How and why conflicts ...
Journey of developing a Performance Optimization Feature in PostgreSQL
presented by Rahila Syed from Microsoft
In this talk, I will share the journey of identifying and optimizing a performance bottleneck in PostgreSQL. The session will walk through a systematic approach to diagnosing performance issues — distinguishing whether the bottleneck lies in the CPU, I/O, or network — and how iterative profiling and analysis can guide effective optimizations. Using perf and other diagnostic tools, we’ll examin...
more Database AdministrationWhere is My Memory? (Postgres Edition)
presented by Pushkar Khadilkar, Mohit Agarwal from Google
Where is My Memory? (Postgres Edition)
Speakers: Mohit Agarwal, Pushkar Khadilkar
This talk delves into the intricate world of PostgreSQL memory management, exploring the various components that contribute to its memory footprint and offering insights into advanced optimization techniques. The session will empower developers and DBAs to gain a comprehensive understanding of where Po...
more Database Engine DevelopersInside PostgreSQL High Availability: Quorum, Split-Brain, and Failover at Scale
presented by Venkat Akhil from OpenSource DB
Abstract
PostgreSQL High Availability (HA) isn’t just about replication,it’s about making distributed systems converge under failure. This session goes deep into the internals of PostgreSQL’s HA stack, focusing on how Patroni, etcd/Consul, and HAProxy collaborate to maintain consensus and avoid split-brain scenarios. We’ll dissect the timing and sequence of failover events, WAL divergence ...
more Database AdministrationThe Life of a Tuple in Logical Replication
presented by Shlok Kyal from Fujitsu
Logical replication is one of the most intricate yet fascinating parts of PostgreSQL. In this session, we’ll take a closer look at how a single tuple travels through the logical replication pipeline, from the moment it’s changed on the publisher to when it’s finally applied on the subscriber.
We’ll begin with how changes are captured in the WAL, then see how the walsender and reorder buffer ...
more Database Engine DevelopersKey PostgreSQL Query Tuning foundation Every Database Developer Should Build
presented by Deepak Mahto from DataCloudGaze Consulting
Tuning SQL in PostgreSQL isn’t about chasing magic settings or quick fixes it’s about understanding how the database thinks. In this session, I’ll share a structured approach to finding and fixing slow queries by focusing on fundamentals: identifying problematic SQL, reading execution plans with intent, and connecting what you see in the plan to what the optimizer actually does.
We’ll wa...
more Database AdministrationOperating Postgres Logical Replication at Massive Scale
presented by Sai Srirampur from Clickhouse
ClickHouse is a large-scale user of Postgres logical replication for Change Data Capture, replicating over 200 TB of data per month from Postgres across 300+ customers. Some individual customer deployments exceed 60 TB, moving tens of terabytes monthly.
In this talk, we’ll share our journey of scaling Postgres logical decoding to this level—covering the Postgres-specific optimizations, chall...
more Case StudyMastering CloudNativePG: Install Scale & Recover
Mastering CloudNativePG: Install Scale & Recover
presented by Danish Khan, Neel Patel from EnterpriseDB
Mastering CloudNativePG: Install, Scale, and Recover
Running stateful databases on Kubernetes used to be hard, but the CloudNativePG (CNPG) operator makes it production-ready. In this hands-on workshop, you will learn the full lifecycle of managing PostgreSQL in a cloud-native environment. We will move beyond the basics to cover the three critical pillars of database operations: reliable dep...
more TrainingPostgreSQL Tools & Features to Maximize Developer and DBA Productivity & PostgreSQL 18 New Features Enhancements
PostgreSQL Tools & Features to Maximize Developer and DBA Productivity & PostgreSQL 18 New Features Enhancements
presented by Vinay Kumar Paladi, Sashikanta Pattanayak from Amazon Web Services
This training focuses on helping developers and DBAs leverage the powerful tools and features within PostgreSQL to enhance productivity, improve database performance, and simplify administration. The session will cover a wide range of essential PostgreSQL capabilities—from core functionalities to advanced optimizations—and explore the exciting new features introduced in PostgreSQL 18. Through ...
more TrainingA Disastrous Day in the Life of a PostgreSQL DBA
A Disastrous Day in the Life of a PostgreSQL DBA
presented by Kanthanathan S, Sathakkathullah from IITM Pravartak
This 3-hour workshop simulates a high-pressure day of PostgreSQL outages—application slowdowns, storage spikes, WAL explosions, corruption alerts, and accidental data loss. Participants practice real-world troubleshooting using PostgreSQL logs, pg_stat_activity, lock diagnostics, autovacuum and bloat handling, WAL and checkpoint analysis, live query tuning, and backup/PITR-based recovery. The...
Build Agentic AI with Semantic Kernel and Graph RAG on PostgreSQL
Build Agentic AI with Semantic Kernel and Graph RAG on PostgreSQL
presented by Shriram Muthukrishnan from Microsoft
In this hands-on lab, you will build an agent-driven Retrieval-Augmented Generation (RAG) application using a U.S. Case Law dataset. You will work with Azure Database for PostgreSQL, Visual Studio Code, and the Semantic Kernel Agent Framework to explore AI-powered text and vector search. The lab also introduces GraphRAG with Apache AGE, enabling enriched querying across structured and graph-bas...
more TrainingPostgreSQL Performance Clinic: Live Diagnosis and Tuning
PostgreSQL Performance Clinic: Live Diagnosis and Tuning
presented by Neeta Goel, Hari Prasad from Newt Global
Trainers will discuss key performance considerations for complex Oracle workloads migrated to PostgreSQL. The session will feature real-world examples in an interactive format.
Topics Covered
1.Using database metrics for debugging and performance tuning
2.Identifying lock contention, bloat, deadlocks, and vacuum-related issues
3.Analyzing and tuning slow queries with EXPLAIN (ANALYZ...
more TrainingSupercharge Your PostgreSQL: A Hands-On Guide to High Availability and Connection Pooling with ProxySQL
Supercharge Your PostgreSQL: A Hands-On Guide to High Availability and Connection Pooling with ProxySQL
presented by René Cannaò, Wazir Ahmed from ProxySQL LLC
As PostgreSQL deployments scale to handle "always-on" applications, managing connection overhead, ensuring high availability, and optimizing query traffic become critical challenges. ProxySQL, a high-performance open-source database proxy, is designed to solve these problems at scale.
This 3-hour hands-on workshop guides attendees through the architecture, configuration, and practical applic...
more TrainingBuilding the Open Source Data Stack with Postgres and ClickHouse
Building the Open Source Data Stack with Postgres and ClickHouse
presented by Kevin Biju, Amogh Bharadwaj from Clickhouse
An increasing number of organizations are adopting a proven architectural pattern: using PostgreSQL for transactional workloads and ClickHouse for analytics. Instead of relying on a single database to do everything, this approach combines two mature open-source systems — each optimized for its specific purpose. It’s the foundation behind real-time data platforms at companies like GitLab, Cloudf...
more TrainingAgentic AI Applications with Google Cloud
Agentic AI Applications with Google Cloud
presented by Abhijeet Rajurkar, Saurabh Gupta from Google
Join us for an immersive training session that will equip you with the essential skills OF AI with operation databases. This training will be focused on building Agentic AI Applications with Google Cloud PostgresSQL variants. Use Gemini hosted on vertex AI and integrate it with custom agents developed by you . Configure these agents to connect to postgres and use NL2SQL to query the database....
more Training