chukwa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stuti Awasthi (JIRA)" <>
Subject [jira] Created: (CHUKWA-580) Checkpoints increasing without sending data to collector
Date Mon, 17 Jan 2011 15:15:43 GMT
Checkpoints increasing without sending data to collector

                 Key: CHUKWA-580
             Project: Chukwa
          Issue Type: Bug
    Affects Versions: 0.4.0
         Environment: RHEL
            Reporter: Stuti Awasthi

I have a query regarding the checkpoints in chukwa. According to theory :
Every few minutes, each agent process polls a collector to find the length of each file to
which data is being written. The length of the file is then compared with the offset at which
each chunk was to be written. If the file length exceeds this value, then the data has been
committed and the agent process advances its checkpoint accordingly.(Note that the length
returned by the filesystem is the amount of data that has been successfully replicated.)

This means that chukwa_agent_checkpoint would increase only when the agent receivers and ack
from the collectors. But in case of dirtailing adapter, this is not correct. I have done the
following steps to test this :
-         Started agent with some dummy collector which was not present. 
-         Added dirtailing adapter with Charfile tailing adapter
I can see the following output in my checkpoint file :
ADD adaptor_67653208e8dea46c798e46753fc19dad = org.apache.hadoop.chukwa.datacollection.adaptor.filetailer.CharFileTailingAdaptorUTF8
Stuti 0 /root/Stuti/yum.log 0
ADD adaptor_b505db62647203ffa3cfe17374042870 = org.apache.hadoop.chukwa.datacollection.adaptor.DirTailingAdaptor
Stuti /root/Stuti filetailer.CharFileTailingAdaptorUTF8 1295014173306

Since data is not getting sent to collector, so checkpoints should not increase.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message