Have you tried to change: me.prettyprint.cassandra.service.CassandraHostConfigurator#retryDownedHostsDelayInSeconds ?

Hector will ping down hosts every xx seconds and recover connection.

Regards,
Maciej

On Mon, Mar 5, 2012 at 8:13 PM, Daning Wang <daning@netseer.com> wrote:
I just got this error ": All host pools marked down. Retry burden pushed out to client." in a few clients recently, client could not  recover, we have to restart client application.  we are using 0.8.0.3 hector.

At that time we did compaction  for a CF, it takes several hours, server was busy. But I think client should recover after server load was down.

Any bug reported about this? I did search but could not find one.

Thanks,

Daning