Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BA4F46CD2 for ; Fri, 20 May 2011 16:07:24 +0000 (UTC) Received: (qmail 42405 invoked by uid 500); 20 May 2011 16:07:22 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 42271 invoked by uid 500); 20 May 2011 16:07:22 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 42262 invoked by uid 99); 20 May 2011 16:07:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 May 2011 16:07:22 +0000 X-ASF-Spam-Status: No, hits=3.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of markq2011@gmail.com designates 74.125.83.176 as permitted sender) Received: from [74.125.83.176] (HELO mail-pv0-f176.google.com) (74.125.83.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 May 2011 16:07:16 +0000 Received: by pve37 with SMTP id 37so2535252pve.35 for ; Fri, 20 May 2011 09:06:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=NpadwrwTrrfV1rT6t/CqAeHaBGU8DeCsf3luBLzpKUE=; b=GcGhr1tk158u6ZKtz7VjTZ9zv84OmJPaTNLZb4AN1fTxZ7pZz0yvjuUTEZ4VsiaASY h8KqpehbbXwkBuQPkrrAxHOP9nJvTXVAgbcs7RsSYHdhi4w63fFiJMyEwiZqjgsLTA4d qd1YIDcv9YT7bOqMUqgFfEEaD1eMForM3DVTs= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=Y7HRZkAT69sIkSjsuM5lslJqCKI1a1TaHY3fxATWjsLlR5DEz4kl9JvJUP0cjlfJnv kdG8xKzJybqeF2AM0BxskK/M+9WyQQAQljRl+3peZptKaZbMhp5ahyuBSLC/USUQk1GB VFtdOQXhqKaR0klckFJXgayWe/xG+QsEJLZck= MIME-Version: 1.0 Received: by 10.142.234.14 with SMTP id g14mr1882690wfh.271.1305907616501; Fri, 20 May 2011 09:06:56 -0700 (PDT) Received: by 10.142.120.33 with HTTP; Fri, 20 May 2011 09:06:56 -0700 (PDT) In-Reply-To: References: Date: Fri, 20 May 2011 09:06:56 -0700 Message-ID: Subject: Re: outputCollector vs. Localfile From: Mark question To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=000e0cd313509751f604a3b74f49 --000e0cd313509751f604a3b74f49 Content-Type: text/plain; charset=ISO-8859-1 I thought it was, because of FileBytesWritten counter. Thanks for the clarification. Mark On Fri, May 20, 2011 at 4:23 AM, Harsh J wrote: > Mark, > > On Fri, May 20, 2011 at 10:17 AM, Mark question > wrote: > > This is puzzling me ... > > > > With a mapper producing output of size ~ 400 MB ... which one is > supposed > > to be faster? > > > > 1) output collector: which will write to local file then copy to HDFS > since > > I don't have reducers. > > A regular map-only job does not write to the local FS, it writes to > the HDFS directly (i.e., a local DN if one is found). > > -- > Harsh J > --000e0cd313509751f604a3b74f49--