hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Trivial Update of "Chukwa_Processes_and_Data_Flow" by BillGraham
Date Tue, 02 Feb 2010 20:36:49 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Chukwa_Processes_and_Data_Flow" page has been changed by BillGraham.
http://wiki.apache.org/hadoop/Chukwa_Processes_and_Data_Flow?action=diff&rev1=1&rev2=2

--------------------------------------------------

+ <<TableOfContents>>
+ 
+ == Overview ==
  This document describes how Chukwa data is stored in HDFS and the processes that act on
it.
  
- '''HDFS File System Structure'''
+ == HDFS File System Structure ==
  
  The general layout of the Chukwa filesystem is as follows.
  
@@ -19, +22 @@

     temp/
  }}}
  
- '''Raw Log Collection and Aggregation Workflow'''
+ == Raw Log Collection and Aggregation Workflow ==
  
  What data is stored where is best described by stepping through the Chukwa workflow.
  
@@ -52, +55 @@

    * to: {{{archivesProcessing/mrOutput}}}
    * to: {{{finalArchives/[yyyyMMdd]/*/chukwaArchive-part-*}}} 
  
- '''Log Directories Requiring Cleanup'''
+ == Log Directories Requiring Cleanup ==
  
  The following directories will grow over time and will need to be periodically pruned:
  

Mime
View raw message