I was asked from a few people about my opinion of the Github's recent service outage. As a creator of MHA, I have lots of MySQL failover experiences. Here are my points about failover design. Most of them duplicate with Robert's points. - "Too Many Connections" is not a reason to start automated failover - Do not repeat failover I know some unsuccessful failover stories that "1. failover happens b