chukwa-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ying Tang <ivytang0...@gmail.com>
Subject Re: chukwa agent doesn't collect the log suddenly , and after several days ,the agent crashes.
Date Tue, 26 Jul 2011 06:07:25 GMT
The log file is log4j log file ,and the size is 10M ,the maxbackupindex is
1.



On Tue, Jul 26, 2011 at 1:42 PM, Eric Yang <eric818@gmail.com> wrote:

> Can you run "ls -l" to show the size and dateof the log files that you
> are streaming?
>
> regards,
> Eric
>
> On Mon, Jul 25, 2011 at 7:36 PM, Ying Tang <ivytang0812@gmail.com> wrote:
> > The chukwa version is 0.4.0 and the adaptor is
> >
> org.apache.hadoop.chukwa.datacollection.adaptor.filetailer.CharFileTailingAdaptorUTF8
> >
> > On Mon, Jul 25, 2011 at 11:50 PM, Eric Yang <eric818@gmail.com> wrote:
> >>
> >> Hi Ivy,
> >>
> >> When data is send from agent to collector, collector send acknowledgment
> >> of receiving of the chunks.  At 00:03:28, there are 5 chunks
> acknowledged.
> >>  This means communication between collector and agent are working at
> that
> >> point in time.  However, there is no activity after 00:04:28.  This
> looks
> >> like adaptor did not handle the log rotation properly at close to
> midnight.
> >>  Which version of Chukwa are you using and which adaptor are you using?
> >>
> >> regards,
> >> Eric
> >>
> >> On Jul 25, 2011, at 12:40 AM, Ying Tang wrote:
> >>
> >> > Hi all,
> >> >
> >> > In my cluster , i have two chukwa agent and one collector .
> >> > At a time ,  both chukwa agents's log :
> >> > 2011-07-18 00:03:28,688 INFO Timer-1 HttpConnector - # http chunks
> >> > ACK'ed since last report: 5
> >> > 2011-07-18 00:04:28,697 INFO Timer-1 HttpConnector - # http chunks
> >> > ACK'ed since last report: 0
> >> > 2011-07-18 00:05:28,706 INFO Timer-1 HttpConnector - # http chunks
> >> > ACK'ed since last report: 0
> >> > 2011-07-18 00:06:28,714 INFO Timer-1 HttpConnector - # http chunks
> >> > ACK'ed since last report: 0
> >> > 2011-07-18 00:07:29,340 INFO Timer-1 HttpConnector - # http chunks
> >> > ACK'ed since last report: 0
> >> >
> >> > And the collector
> >> > 2011-07-17 11:02:32,155 INFO Timer-3 SeqFileWriter -
> >> > stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >> > 2011-07-17 11:02:43,074 INFO Timer-1 root -
> >> > stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> >> > 2011-07-17 11:03:02,162 INFO Timer-3 SeqFileWriter -
> >> > stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >> > 2011-07-17 11:03:32,168 INFO Timer-3 SeqFileWriter -
> >> > stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >> > 2011-07-17 11:03:43,085 INFO Timer-1 root -
> >> > stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> >> > 2011-07-17 11:04:02,174 INFO Timer-3 SeqFileWriter -
> >> > stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >> > 2011-07-17 11:04:32,180 INFO Timer-3 SeqFileWriter -
> >> > stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >> > 2011-07-17 11:04:43,096 INFO Timer-1 root -
> >> > stats:ServletCollector,numberHTTPConnection:0,numberchunks:0
> >> > 2011-07-17 11:05:02,185 INFO Timer-3 SeqFileWriter -
> >> > stat:datacollection.writer.hdfs dataSize=0 dataRate=0
> >> >
> >> > (the collector and agent has  different  timezone)
> >> > And the collector didn't collect any log.
> >> >
> >> >
> >> > What dons the "http chunks ACK'ed since last report: 0" means?
> >> > And from this log "http chunks ACK'ed since last report: 0" appears to
> >> >  agent crash, the chukwa port still on , but after several days, both
> agents
> >> > crashed without exceptions.
> >> >
> >> >
> >> > --
> >> > Best regards,
> >> >
> >> > Ivy Tang
> >> >
> >> >
> >> >
> >>
> >
> >
> >
> > --
> > Best regards,
> > Ivy Tang
> >
> >
> >
>



-- 
Best regards,

Ivy Tang

Mime
View raw message