hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kai Zheng (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-12041) Implement another Reed-Solomon coder in pure Java
Date Tue, 18 Aug 2015 13:03:45 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-12041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Kai Zheng updated HADOOP-12041:
-------------------------------
    Attachment: HADOOP-12041-v1.patch

Uploaded the patch, read for review. It's running much faster than original HDFS-RAID based
Java coder (5X) and compatible with the upcoming native/ISA-L coder.

> Implement another Reed-Solomon coder in pure Java
> -------------------------------------------------
>
>                 Key: HADOOP-12041
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12041
>             Project: Hadoop Common
>          Issue Type: Sub-task
>            Reporter: Kai Zheng
>            Assignee: Kai Zheng
>         Attachments: HADOOP-12041-v1.patch
>
>
> Currently existing Java RS coders based on {{GaloisField}} implementation have some drawbacks
or limitations:
> * The decoder computes not really erased units unnecessarily (HADOOP-11871);
> * The decoder requires parity units + data units order for the inputs in the decode API
(HADOOP-12040);
> * Need to support or align with native erasure coders regarding concrete coding algorithms
and matrix, so Java coders and native coders can be easily swapped in/out and transparent
to HDFS (HADOOP-12010);
> * It's unnecessarily flexible but incurs some overhead, as HDFS erasure coding is totally
a byte based data system, we don't need to consider other symbol size instead of 256.
> This desires to implement another  RS coder in pure Java, in addition to the existing
{{GaliosField}} from HDFS-RAID. The new Java RS coder will be favored and used by default
to resolve the related issues. The old HDFS-RAID originated coder will still be there for
comparing, and converting old data from HDFS-RAID systems.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message