hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Colin Patrick McCabe (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-12949) Add HTrace to the s3a connector
Date Mon, 21 Mar 2016 17:57:25 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-12949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15204750#comment-15204750

Colin Patrick McCabe commented on HADOOP-12949:

Hi [~madhawa], great idea!  I think the first thing to do is to read a bit about how to set
up HTrace.  See http://blog.cloudera.com/blog/2015/12/new-in-cloudera-labs-apache-htrace-incubating/
If you can get a working setup for HTrace-on-HDFS, it will help for adding tracing to other
projects such as the s3a connector.

> Add HTrace to the s3a connector
> -------------------------------
>                 Key: HADOOP-12949
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12949
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Madhawa Gunasekara
> Hi All, 
> s3, GCS, WASB, and other cloud blob stores are becoming increasingly important in Hadoop.
But we don't have distributed tracing for these yet. It would be interesting to add distributed
tracing here. It would enable collecting really interesting data like probability distributions
of PUT and GET requests to s3 and their impact on MR jobs, etc.
> I would like to implement this feature, Please shed some light on this 
> Thanks,
> Madhawa

This message was sent by Atlassian JIRA

View raw message