Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 67870 invoked from network); 4 Dec 2009 17:48:43 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Dec 2009 17:48:43 -0000 Received: (qmail 96471 invoked by uid 500); 4 Dec 2009 17:48:43 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 96406 invoked by uid 500); 4 Dec 2009 17:48:43 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 96396 invoked by uid 99); 4 Dec 2009 17:48:43 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2009 17:48:43 +0000 X-ASF-Spam-Status: No, hits=-10.5 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2009 17:48:40 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id AE7F4234C04C for ; Fri, 4 Dec 2009 09:48:20 -0800 (PST) Message-ID: <738074360.1259948900713.JavaMail.jira@brutus> Date: Fri, 4 Dec 2009 17:48:20 +0000 (UTC) From: "Allen Wittenauer (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-808) Implement something like PAR2 support? In-Reply-To: <1991207286.1259945060653.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786026#action_12786026 ] Allen Wittenauer commented on HDFS-808: --------------------------------------- I'm thinking about the situation where you have the complete file except one or two blocks are completely missing (i.e., no replicas). Using something like PAR2 you'd be able to reconstruct the missing block completely. > Implement something like PAR2 support? > -------------------------------------- > > Key: HDFS-808 > URL: https://issues.apache.org/jira/browse/HDFS-808 > Project: Hadoop HDFS > Issue Type: New Feature > Reporter: Allen Wittenauer > Priority: Minor > > We really need an Idea issue type, because I'm not sure if this is really viable. :) Just sort of thinking "out loud". > I was thinking about how file recovery works on services like Usenet to fix data corruption when chunks of files are missing. I wonder how hard it would be to implement something like PAR2 [ http://en.wikipedia.org/wiki/Parchive ] automatically for large files. We'd have the advantage of being able to do it in binary of course and could likely hide the details within HDFS itself. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.