hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Li Lu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-5814) Add druid as storage backend in YARN Timeline Service
Date Mon, 14 Nov 2016 23:38:58 GMT

    [ https://issues.apache.org/jira/browse/YARN-5814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15665393#comment-15665393
] 

Li Lu commented on YARN-5814:
-----------------------------

Thanks [~BINGXUE QIU] for the doc! I have some quick questions:
1. According to the Design section, the writer may require tranquility and/or kafka as intermediate
layers. I'm wondering if there are any issues with these dependencies? 
2. For the table design, right now in timeline v.2, container is not a top-level concept (although
it is a top-level concept for YARN). Therefore I'm not sure if it is helpful to generalize
the container table to an entity table, just as the HBase implementation? We may still put
container level data into this table, but maybe it's possible to not to limit this table to
container only? 

>  Add druid as storage backend in YARN Timeline Service
> ------------------------------------------------------
>
>                 Key: YARN-5814
>                 URL: https://issues.apache.org/jira/browse/YARN-5814
>             Project: Hadoop YARN
>          Issue Type: New Feature
>          Components: ATSv2
>    Affects Versions: 3.0.0-alpha2
>            Reporter: Bingxue Qiu
>         Attachments: Add-Druid-in-YARN-Timeline-Service.pdf
>
>
> h3. Introduction
> I propose to add druid as storage backend in YARN Timeline Service.
> We run more than 6000 applications and generate 450 million metrics daily in Alibaba
Clusters with thousands of nodes. We need to collect and store meta/events/metrics data, online
analyze the utilization reports of various dimensions and display the trends of allocation/usage
resources for cluster by joining and aggregating data. It helps us to manage and optimize
the cluster by tracking resource utilization.
> To achieve our goal we have changed to use druid as the storage instead of HBase and
have achieved sub-second OLAP performance in our production environment for few months. 
> h3. Analysis
> Currently YARN Timeline Service only supports aggregating metrics at a) flow level by
FlowRunCoprocessor and b) application level metrics aggregating by AppLevelTimelineCollector,
offline (time-based periodic) aggregation for flows/users/queues for reporting and analysis
is planned but not yet implemented. YARN Timeline Service chooses Apache HBase as the primary
storage backend. As we all know that HBase doesn't fit for OLAP.
>  For arbitrary exploration of data,such as online analyze the utilization reports of
various dimensions(Queue,Flow,Users,Application,CPU,Memory) by joining and aggregating data,
Druid's custom column format enables ad-hoc queries without pre-computation. The format also
enables fast scans on columns, which is important for good aggregation performance.
> To achieve our goal that support to online analyze the utilization reports of various
dimensions, display the variation trends of allocation/usage resources for cluster, and arbitrary
exploration of data, we propose to add druid storage and implement DruidWriter /DruidReader
in YARN Timeline Service.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message