Join us

When failover isn’t safe: Building high-availability PostgreSQL on Kubernetes

When failover isn’t safe: Building high-availability PostgreSQL on Kubernetes

Datadog made PostgreSQL failover safer by treating replica lag as the promotion gate. A zonal-failure gameday showed that detection and automation could not protect the database if the standby sat behind the primary. The team added lag-aware checks, clearer operator signals, and failure drills so engineers could fail over with a known data-loss boundary.


Give this a Pawfive!


Only registered users can post comments. Please, login or signup.

Start writing about what excites you in tech — connect with developers, grow your voice, and get rewarded.

Join other developers and claim your FAUN.dev() account now!

Avatar

Kaptain #Kubernetes

FAUN.dev()

@kaptain
Kubernetes Weekly Newsletter, Kaptain. Curated Kubernetes news, tutorials, tools and more!
Developer Influence
6

Influence

48k

Total Hits

190

Posts