hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhe Zhang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-11828) Implement the Hitchhiker erasure coding algorithm
Date Wed, 20 Jan 2016 06:41:40 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-11828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15108115#comment-15108115
] 

Zhe Zhang commented on HADOOP-11828:
------------------------------------

Thanks Jack for the great work, and Rashmi / Kai for the reviews! The patch and it LGTM overall.
I about about to finish and post my review. Meanwhile, could you add Apache license header
to {{HHXORErasureDecodingStep}} (see this [complain | https://builds.apache.org/job/PreCommit-HADOOP-Build/8425/artifact/patchprocess/patch-asflicense-problems.txt]),
and add a trivial {{package-info}} for now (as Kai suggested)? The purpose of the class is
for Javadoc and I do think we will add Javadoc for the new package later.

> Implement the Hitchhiker erasure coding algorithm
> -------------------------------------------------
>
>                 Key: HADOOP-11828
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11828
>             Project: Hadoop Common
>          Issue Type: Sub-task
>            Reporter: Zhe Zhang
>            Assignee: jack liuquan
>         Attachments: 7715-hitchhikerXOR-v2-testcode.patch, 7715-hitchhikerXOR-v2.patch,
HADOOP-11828-hitchhikerXOR-V3.patch, HADOOP-11828-hitchhikerXOR-V4.patch, HADOOP-11828-hitchhikerXOR-V5.patch,
HADOOP-11828-hitchhikerXOR-V6.patch, HADOOP-11828-hitchhikerXOR-V7.patch, HDFS-7715-hhxor-decoder.patch,
HDFS-7715-hhxor-encoder.patch
>
>
> [Hitchhiker | http://www.eecs.berkeley.edu/~nihar/publications/Hitchhiker_SIGCOMM14.pdf]
is a new erasure coding algorithm developed as a research project at UC Berkeley. It has been
shown to reduce network traffic and disk I/O by 25%-45% during data reconstruction while retaining
the same storage capacity and failure tolerance capability as RS codes. This JIRA aims to
introduce Hitchhiker to the HDFS-EC framework, as one of the pluggable codec algorithms.
> The existing implementation is based on HDFS-RAID. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message