Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 49181 invoked from network); 2 Nov 2009 15:28:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Nov 2009 15:28:20 -0000 Received: (qmail 49436 invoked by uid 500); 2 Nov 2009 15:28:18 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 49343 invoked by uid 500); 2 Nov 2009 15:28:17 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 49333 invoked by uid 99); 2 Nov 2009 15:28:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Nov 2009 15:28:17 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=BAYES_00 X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: 81.17.254.11 is neither permitted nor denied by domain of architect@galatea.com) Received: from [81.17.254.11] (HELO mail.blacknight.com) (81.17.254.11) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Nov 2009 15:28:14 +0000 Received: (qmail 26668 invoked from network); 2 Nov 2009 15:27:51 -0000 Received: from unknown (HELO ?192.168.2.6?) (architect@galatea.com@[72.14.179.12]) by 81.17.254.9 with ESMTPS (DHE-RSA-AES256-SHA encrypted); 2 Nov 2009 15:27:50 -0000 Message-ID: <4AEEFA58.1090509@galatea.com> Date: Mon, 02 Nov 2009 15:27:20 +0000 From: L User-Agent: Thunderbird 2.0.0.23 (Windows/20090812) MIME-Version: 1.0 To: common-user@hadoop.apache.org Subject: Re: Multiple Input Paths References: <5D66A842901F8E41815AF6D27A28EC490A8527F4B9@Mail-Ab02.rmg-ny.com> In-Reply-To: <5D66A842901F8E41815AF6D27A28EC490A8527F4B9@Mail-Ab02.rmg-ny.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Mark, Is the structure of both files the same? It makes even more sense to combine the files, if you can, as I have seen a considerable speed up when I've done that (at least when I've had small files to deal with). Lajos Mark Vigeant wrote: > Hey, quick question: > > I'm writing a program that parses data from 2 different files and puts the data into a table. Currently I have 2 different map functions and so I submit 2 separate jobs to the job client. Would it be more efficient to add both paths to the same mapper and only submit one job? Thanks a lot! > > Mark Vigeant > RiskMetrics Group, Inc. > --