Have you tried to change: me.prettyprint.cassandra.service.CassandraHostConfigurator#retryDownedHostsDelayInSeconds ?
Hector will ping down hosts every xx seconds and recover connection.
I just got this error ": All host pools marked down. Retry burden pushed out to client." in a few clients recently, client could not recover, we have to restart client application. we are using 0.8.0.3 hector.
At that time we did compaction for a CF, it takes several hours, server was busy. But I think client should recover after server load was down.
Any bug reported about this? I did search but could not find one.