incubator-chukwa-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eric Yang <eric...@gmail.com>
Subject Re: chukwa agent doesn't collect the log suddenly , and after several days ,the agent crashes.
Date Mon, 25 Jul 2011 15:50:52 GMT
Hi Ivy,

When data is send from agent to collector, collector send acknowledgment of receiving of the
chunks.  At 00:03:28, there are 5 chunks acknowledged.  This means communication between collector
and agent are working at that point in time.  However, there is no activity after 00:04:28.
 This looks like adaptor did not handle the log rotation properly at close to midnight.  Which
version of Chukwa are you using and which adaptor are you using?

regards,
Eric

On Jul 25, 2011, at 12:40 AM, Ying Tang wrote:

> Hi all,
>  
> In my cluster , i have two chukwa agent and one collector .
> At a time ,  both chukwa agents's log :
> 2011-07-18 00:03:28,688 INFO Timer-1 HttpConnector - # http chunks ACK'ed since last
report: 5
> 2011-07-18 00:04:28,697 INFO Timer-1 HttpConnector - # http chunks ACK'ed since last
report: 0
> 2011-07-18 00:05:28,706 INFO Timer-1 HttpConnector - # http chunks ACK'ed since last
report: 0
> 2011-07-18 00:06:28,714 INFO Timer-1 HttpConnector - # http chunks ACK'ed since last
report: 0
> 2011-07-18 00:07:29,340 INFO Timer-1 HttpConnector - # http chunks ACK'ed since last
report: 0
>  
> And the collector
> 2011-07-17 11:02:32,155 INFO Timer-3 SeqFileWriter - stat:datacollection.writer.hdfs
dataSize=0 dataRate=0
> 2011-07-17 11:02:43,074 INFO Timer-1 root - stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> 2011-07-17 11:03:02,162 INFO Timer-3 SeqFileWriter - stat:datacollection.writer.hdfs
dataSize=0 dataRate=0
> 2011-07-17 11:03:32,168 INFO Timer-3 SeqFileWriter - stat:datacollection.writer.hdfs
dataSize=0 dataRate=0
> 2011-07-17 11:03:43,085 INFO Timer-1 root - stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> 2011-07-17 11:04:02,174 INFO Timer-3 SeqFileWriter - stat:datacollection.writer.hdfs
dataSize=0 dataRate=0
> 2011-07-17 11:04:32,180 INFO Timer-3 SeqFileWriter - stat:datacollection.writer.hdfs
dataSize=0 dataRate=0
> 2011-07-17 11:04:43,096 INFO Timer-1 root - stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> 2011-07-17 11:05:02,185 INFO Timer-3 SeqFileWriter - stat:datacollection.writer.hdfs
dataSize=0 dataRate=0
>  
> (the collector and agent has  different  timezone)
> And the collector didn't collect any log.
>  
>  
> What dons the "http chunks ACK'ed since last report: 0" means?
> And from this log "http chunks ACK'ed since last report: 0" appears to  agent crash,
the chukwa port still on , but after several days, both agents crashed without exceptions.
>  
>  
> -- 
> Best regards,
> 
> Ivy Tang
> 
> 
> 


Mime
View raw message