Return-Path: Delivered-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Received: (qmail 5682 invoked from network); 31 Mar 2011 21:21:46 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 31 Mar 2011 21:21:46 -0000 Received: (qmail 33227 invoked by uid 500); 31 Mar 2011 21:21:46 -0000 Delivered-To: apmail-hadoop-mapreduce-dev-archive@hadoop.apache.org Received: (qmail 33163 invoked by uid 500); 31 Mar 2011 21:21:46 -0000 Mailing-List: contact mapreduce-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-dev@hadoop.apache.org Delivered-To: mailing list mapreduce-dev@hadoop.apache.org Received: (qmail 33155 invoked by uid 99); 31 Mar 2011 21:21:46 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Mar 2011 21:21:46 +0000 X-ASF-Spam-Status: No, hits=-1996.4 required=5.0 tests=ALL_TRUSTED,FS_REPLICA,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Mar 2011 21:21:43 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 151BA8C65E for ; Thu, 31 Mar 2011 21:21:06 +0000 (UTC) Date: Thu, 31 Mar 2011 21:21:06 +0000 (UTC) From: "Scott Chen (JIRA)" To: mapreduce-dev@hadoop.apache.org Message-ID: <752066171.25780.1301606466083.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Resolved] (MAPREDUCE-1861) Raid should rearrange the replicas while raiding MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Scott Chen resolved MAPREDUCE-1861. ----------------------------------- Resolution: Won't Fix We found that this approach puts some load on namenode. We will go with the approach in MAPREDUCE-2275. > Raid should rearrange the replicas while raiding > ------------------------------------------------ > > Key: MAPREDUCE-1861 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1861 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid > Affects Versions: 0.23.0 > Reporter: Scott Chen > Assignee: Scott Chen > Fix For: 0.23.0 > > Attachments: MAPREDUCE-1861-v2.txt, MAPREDUCE-1861-v3.txt, MAPREDUCE-1861-v4.txt, MAPREDUCE-1861.txt, MAPREDUCE-1861.txt > > > Raided file introduce extra dependencies on the blocks on the same stripe. > Therefore we need a new way to place the blocks. > It is desirable that raided file satisfies the following two conditions: > a. Replicas on the same stripe should be on different machines (or racks) > b. Replicas of the same block should be on different racks > MAPREDUCE-1831 will try to delete the replicas on the same stripe and the same machine (a). > But in the mean time, it will try to maintain the number of distinct racks of one block (b). > We cannot satisfy (a) and (b) at the same time with the current logic in BlockPlacementPolicyDefault.chooseTarget(). > One choice we have is to change BlockPlacementPolicyDefault.chooseTarget(). > However, this placement is in general good for all files including the unraided ones. > It is not clear to us that we can make this good for both raided and unraided files. > So we propose this idea that when raiding the file. We create one more off-rack replica (so the replication=4 now). > Than we delete two blocks using the policy in MAPREDUCE-1831 after that (replication=2 now). > This way we can rearrange the replicas to satisfy (a) and (b) at the same time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira