hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nam H. Do (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2681) Support bandwidth enforcement for containers while reading from HDFS
Date Sat, 04 Jul 2015 13:14:05 GMT

    [ https://issues.apache.org/jira/browse/YARN-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14613804#comment-14613804
] 

Nam H. Do commented on YARN-2681:
---------------------------------

+ The release audit and findbugs warning are incorrectly reported due to the compile of trunk
hadoop-mapreduce-client-app was broken in the pre-patch phase. In fact, hs_err_pid3335.log
is generated during this compile (see trunkFindBugsOutputhadoop-mapreduce-client-app.txt),
and as it is broken, it wasn't able to detect the related findbugs warning (this warning belongs
to NodeLabelExpression feature!).

+ The failed tests are not related to the patch. I verified that all these tests (after merged
with the latest trunk) are passed in our local environment (Java 1.7.0_80 HotSpot(TM) 64-Bit
Server VM (build 24.80-b11, mixed mode)/Linux xcp116 3.13.0-55-generic #94-Ubuntu SMP Thu
Jun 18 00:27:10 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux)

> Support bandwidth enforcement for containers while reading from HDFS
> --------------------------------------------------------------------
>
>                 Key: YARN-2681
>                 URL: https://issues.apache.org/jira/browse/YARN-2681
>             Project: Hadoop YARN
>          Issue Type: New Feature
>          Components: nodemanager
>    Affects Versions: 2.5.1
>         Environment: Linux
>            Reporter: Nam H. Do
>              Labels: BB2015-05-TBR
>             Fix For: 2.7.0
>
>         Attachments: Traffic Control Design.png, YARN-2681.001.patch, YARN-2681.002.patch,
YARN-2681.patch
>
>
> To read/write data from HDFS on data node, applications establise TCP/IP connections
with the datanode. The HDFS read can be controled by setting Linux Traffic Control  (TC) subsystem
on the data node to make filters on appropriate connections.
> The current cgroups net_cls concept can not be applied on the node where the container
is launched, netheir on data node since:
> -   TC hanldes outgoing bandwidth only, so it can be set on container node (HDFS read
= incoming data for the container)
> -   Since HDFS data node is handled by only one process,  it is not possible to use net_cls
to separate connections from different containers to the datanode.
> Tasks:
> 1) Extend Resource model to define bandwidth enforcement rate
> 2) Monitor TCP/IP connection estabilised by container handling process and its child
processes
> 3) Set Linux Traffic Control rules on data node base on address:port pairs in order to
enforce bandwidth of outgoing data
> Concept: http://www.hit.bme.hu/~do/papers/EnforcementDesign.pdf
> Implementation: 
> http://www.hit.bme.hu/~dohoai/documents/HdfsTrafficControl.pdf
> http://www.hit.bme.hu/~dohoai/documents/HdfsTrafficControl_UML_diagram.png



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message