Senior DevOps Engineer
Himanshu is a Senior DevOps Engineer with hands-on experience in cloud infrastructure, automation, and implementing modern DevOps best practices.
In today’s cloud-native world, database availability is absolutely critical. Amazon Aurora, being a high-performance and fully managed relational database, is designed with fault tolerance in mind. But like with most things in tech, the way you configure it can make a huge difference when things go sideways.
In this post, I’ll walk you through a real incident from a production environment that we recently handled. It’s a good example of how things can still go wrong despite having a seemingly resilient setup—and what you can do to avoid similar issues.
One of our customers was running an Aurora cluster with the standard setup—one Writer and one Reader instance. To manage database connections efficiently, they were using RDS Proxy with a read-only endpoint.
On paper, this sounds pretty solid:
However, the Reader instance experienced a host-level failure. At this point, RDS Proxy, which was wired to the read-only endpoint, just waited for the Reader to come back.
What didn’t happen?
When we looked deeper, here’s what we found:
Now, this might feel counterintuitive, but according to the AWS documentation, this is expected behavior.
If you’re using Aurora with RDS Proxy, there are a few things you should definitely consider to avoid this kind of scenario.
Option 1: Improve RDS Proxy Configuration
1. Always Have More Than One Reader
2. Set Failover Priorities Wisely
Option 2: Remove RDS Proxy and Use Cluster Endpoints
Instead of relying on RDS Proxy, use Aurora cluster endpoints directly.
Wrapping Up
At the end of the day, Aurora gives you a solid foundation, but it’s up to us to build a setup that’s actually resilient.
Just adding one more Reader and adjusting failover priorities could have prevented the downtime in this case. These might sound like small tweaks, but in production, they can make the difference between smooth failover and frustrated users.
Also, remember:
Speak with our advisors to learn how you can take control of your Cloud Cost