Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 6928 invoked from network); 18 Jun 2010 00:39:46 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 18 Jun 2010 00:39:46 -0000 Received: (qmail 14894 invoked by uid 500); 18 Jun 2010 00:39:46 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 14868 invoked by uid 500); 18 Jun 2010 00:39:46 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 14860 invoked by uid 99); 18 Jun 2010 00:39:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jun 2010 00:39:45 +0000 X-ASF-Spam-Status: No, hits=-1527.0 required=10.0 tests=ALL_TRUSTED,AWL,FS_REPLICA X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jun 2010 00:39:45 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o5I0dOvm017980 for ; Fri, 18 Jun 2010 00:39:25 GMT Message-ID: <12722991.70621276821564875.JavaMail.jira@thor> Date: Thu, 17 Jun 2010 20:39:24 -0400 (EDT) From: "Scott Chen (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Updated: (MAPREDUCE-1831) Delete the co-located replicas when raiding file In-Reply-To: <11474509.111401275418905110.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-1831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Scott Chen updated MAPREDUCE-1831: ---------------------------------- Status: Patch Available (was: Open) I think the failed contrib test may be TestSimulatorDeterministicReplay.testMain. It is a know issue in MAPREDUCE-1834. But the testReport is gone. I am submitting this to hudson again. > Delete the co-located replicas when raiding file > ------------------------------------------------ > > Key: MAPREDUCE-1831 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1831 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid > Affects Versions: 0.22.0 > Reporter: Scott Chen > Assignee: Scott Chen > Fix For: 0.22.0 > > Attachments: MAPREDUCE-1831.20100610.txt, MAPREDUCE-1831.txt, MAPREDUCE-1831.v1.1.txt > > > In raid, it is good to have the blocks on the same stripe located on different machine. > This way when one machine is down, it does not broke two blocks on the stripe. > By doing this, we can decrease the block error probability in raid from O(p^3) to O(p^4) which can be a hugh improvement (where p is the replica missing probability). > One way to do this is that we can add a new BlockPlacementPolicy which deletes the replicas that are co-located. > So when raiding the file, we can make the remaining replicas live on different machines. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.