chukwa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bill Graham (JIRA)" <j...@apache.org>
Subject [jira] Created: (CHUKWA-534) Improve fault-tolerance of DemuxManager.
Date Tue, 12 Oct 2010 18:33:35 GMT
Improve fault-tolerance of DemuxManager.
----------------------------------------

                 Key: CHUKWA-534
                 URL: https://issues.apache.org/jira/browse/CHUKWA-534
             Project: Chukwa
          Issue Type: Improvement
            Reporter: Bill Graham
            Assignee: Bill Graham


If the DemuxManager received more than 5 consecutive errors, it dies with the message "Too
many errors, Bail out!".

Let's change to this introduce a configurable number of concurrent exceptions to be encountered
before dying. If the value is set to -1, expected behavior is to keep retrying ad infinitum.

Also as part if this bug is to improve logging of how many consecutive errors have occurred,
as well as the time they started. A possible future enhancement could be to support an error
time threshold as well as an absolute count.

Suggesting the following new config setting. It's a bit verbose, but it's clear.

{noformat}
chukwa.demux.max.error.count.before.shutdown
{noformat}


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message