accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jared Winick (JIRA)" <>
Subject [jira] [Commented] (ACCUMULO-1197) Pass Accumulo trace functionality through the DFSClient
Date Wed, 25 Sep 2013 16:44:04 GMT


Jared Winick commented on ACCUMULO-1197:

In a informal exchange with Todd Lipcon he says they would like to get Dapper-like tracing
into HDFS for the next major release as he thinks it is a critical feature for low-latency
workloads. For this to all work though I guess everyone's take on "Dapper-like" needs to be
> Pass Accumulo trace functionality through the DFSClient
> -------------------------------------------------------
>                 Key: ACCUMULO-1197
>                 URL:
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: trace
>            Reporter: Josh Elser
>              Labels: gsoc2013, mentor
> The tracing functionality used by Accumulo provides nested regions of timing information
for regular operations occurring inside of Accumulo, e.g. scans and compactions. There is
basic functionality to view this information through the Accumulo monitor. This can be thought
of as a distributed timing infrastructure for Accumulo which uses Accumulo to store its data.
> Currently, this tracing doesn't fall through into HDFS. It would be awesome to actually
introspect through the DFSClient, all the way down to the datanode writing to local disk.
A large portion of the task would be investigating ways to inject the ability for calling
applications (Accumulo, in this case) to provide the trace client through the Hadoop datanode
code and record the necessary timings
> Skills required would be a good understanding of Java. Some basic knowledge about Apache
Hadoop would also be helpful.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message