incubator-chukwa-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ying Tang <ivytang0...@gmail.com>
Subject Re: chukwa agent doesn't collect the log suddenly , and after several days ,the agent crashes.
Date Tue, 26 Jul 2011 02:36:51 GMT
The chukwa version is 0.4.0 and the adaptor is
org.apache.hadoop.chukwa.datacollection.adaptor.filetailer.CharFileTailingAdaptorUTF8

On Mon, Jul 25, 2011 at 11:50 PM, Eric Yang <eric818@gmail.com> wrote:

> Hi Ivy,
>
> When data is send from agent to collector, collector send acknowledgment of
> receiving of the chunks.  At 00:03:28, there are 5 chunks acknowledged.
>  This means communication between collector and agent are working at that
> point in time.  However, there is no activity after 00:04:28.  This looks
> like adaptor did not handle the log rotation properly at close to midnight.
>  Which version of Chukwa are you using and which adaptor are you using?
>
> regards,
> Eric
>
> On Jul 25, 2011, at 12:40 AM, Ying Tang wrote:
>
> > Hi all,
> >
> > In my cluster , i have two chukwa agent and one collector .
> > At a time ,  both chukwa agents's log :
> > 2011-07-18 00:03:28,688 INFO Timer-1 HttpConnector - # http chunks ACK'ed
> since last report: 5
> > 2011-07-18 00:04:28,697 INFO Timer-1 HttpConnector - # http chunks ACK'ed
> since last report: 0
> > 2011-07-18 00:05:28,706 INFO Timer-1 HttpConnector - # http chunks ACK'ed
> since last report: 0
> > 2011-07-18 00:06:28,714 INFO Timer-1 HttpConnector - # http chunks ACK'ed
> since last report: 0
> > 2011-07-18 00:07:29,340 INFO Timer-1 HttpConnector - # http chunks ACK'ed
> since last report: 0
> >
> > And the collector
> > 2011-07-17 11:02:32,155 INFO Timer-3 SeqFileWriter -
> stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> > 2011-07-17 11:02:43,074 INFO Timer-1 root -
> stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> > 2011-07-17 11:03:02,162 INFO Timer-3 SeqFileWriter -
> stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> > 2011-07-17 11:03:32,168 INFO Timer-3 SeqFileWriter -
> stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> > 2011-07-17 11:03:43,085 INFO Timer-1 root -
> stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> > 2011-07-17 11:04:02,174 INFO Timer-3 SeqFileWriter -
> stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> > 2011-07-17 11:04:32,180 INFO Timer-3 SeqFileWriter -
> stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> > 2011-07-17 11:04:43,096 INFO Timer-1 root -
> stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> > 2011-07-17 11:05:02,185 INFO Timer-3 SeqFileWriter -
> stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >
> > (the collector and agent has  different  timezone)
> > And the collector didn't collect any log.
> >
> >
> > What dons the "http chunks ACK'ed since last report: 0" means?
> > And from this log "http chunks ACK'ed since last report: 0" appears to
>  agent crash, the chukwa port still on , but after several days, both agents
> crashed without exceptions.
> >
> >
> > --
> > Best regards,
> >
> > Ivy Tang
> >
> >
> >
>
>


-- 
Best regards,

Ivy Tang

Mime
View raw message