Return-Path: Delivered-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Received: (qmail 62051 invoked from network); 26 May 2010 21:12:03 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 26 May 2010 21:12:03 -0000 Received: (qmail 40495 invoked by uid 500); 26 May 2010 21:12:03 -0000 Delivered-To: apmail-hadoop-mapreduce-dev-archive@hadoop.apache.org Received: (qmail 40433 invoked by uid 500); 26 May 2010 21:12:03 -0000 Mailing-List: contact mapreduce-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-dev@hadoop.apache.org Delivered-To: mailing list mapreduce-dev@hadoop.apache.org Received: (qmail 40425 invoked by uid 99); 26 May 2010 21:12:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 May 2010 21:12:02 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 May 2010 21:12:00 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o4QLBdPu015480 for ; Wed, 26 May 2010 21:11:39 GMT Message-ID: <30348780.8621274908299093.JavaMail.jira@thor> Date: Wed, 26 May 2010 17:11:39 -0400 (EDT) From: "Ramkumar Vadali (JIRA)" To: mapreduce-dev@hadoop.apache.org Subject: [jira] Created: (MAPREDUCE-1819) RaidNode should submit one job per Raid policy MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org RaidNode should submit one job per Raid policy ---------------------------------------------- Key: MAPREDUCE-1819 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1819 Project: Hadoop Map/Reduce Issue Type: Improvement Components: contrib/raid Affects Versions: 0.20.1 Reporter: Ramkumar Vadali The RaidNode currently computes parity files as follows: 1. Using RaidNode.selectFiles() to figure out what files to raid for a policy 2. Using #1 repeatedly for each configured policy to accumulate a list of files. 3. Submitting a mapreduce job with the list of files from #2 using DistRaid.doDistRaid() This task addresses the fact that #2 and #3 happen sequentially. The proposal is to submit a separate mapreduce job for the list of files for each policy and use another thread to track the progress of the submitted jobs. This will help reduce the time taken for files to be raided. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.