Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B02B2FC5A for ; Fri, 29 Mar 2013 20:10:40 +0000 (UTC) Received: (qmail 94599 invoked by uid 500); 29 Mar 2013 20:10:35 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 94502 invoked by uid 500); 29 Mar 2013 20:10:35 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 94495 invoked by uid 99); 29 Mar 2013 20:10:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Mar 2013 20:10:35 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jens.scheidtmann@gmail.com designates 209.85.214.179 as permitted sender) Received: from [209.85.214.179] (HELO mail-ob0-f179.google.com) (209.85.214.179) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Mar 2013 20:10:30 +0000 Received: by mail-ob0-f179.google.com with SMTP id un3so598979obb.24 for ; Fri, 29 Mar 2013 13:10:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=fu26i/djS9KB3BXj5wWirPm6NQ6/4vebPj5AaXxxtE8=; b=U3ey5IOwA1zwbNIribSDmg7Ysrrvmv4zLeGBaFElEoT1X0tab8AQhWuIOnIW32+wz1 lqePZQzAzD/qhKmQjCaAWTF1fxsFudtkOUF/OoLu43hZdNMzCtFhlXhobjUtV7PyEtzM 1HA5rQwMPaXbynPaqkXbPvVcN2n6e4a8DnL3KBYBRws9YutBiysgD7gT2pqR2QGPn0Cq jfby6AQcGtKfanth9s/pu4hsgr9EOgbBBHotMWVcPApXlKUpn/TGf1GS5DZhgpQ29nHm Uzuxqcrj66Hcd8y1hWKO8eUiOaMm5oFq8s/R4qZGjJA59stjTVSUUwabP+UbwleboXNq J59Q== MIME-Version: 1.0 X-Received: by 10.182.27.40 with SMTP id q8mr1230729obg.100.1364587809411; Fri, 29 Mar 2013 13:10:09 -0700 (PDT) Received: by 10.60.15.196 with HTTP; Fri, 29 Mar 2013 13:10:09 -0700 (PDT) In-Reply-To: <1364577771.12724.YahooMailNeo@web194704.mail.sg3.yahoo.com> References: <1364377874.13753.YahooMailNeo@web194703.mail.sg3.yahoo.com> <1364577771.12724.YahooMailNeo@web194704.mail.sg3.yahoo.com> Date: Fri, 29 Mar 2013 21:10:09 +0100 Message-ID: Subject: Re: Understanding Sys.output from mapper & partitioner From: Jens Scheidtmann To: user@hadoop.apache.org, Sai Sai Content-Type: multipart/alternative; boundary=089e01229762a5658f04d915db28 X-Virus-Checked: Checked by ClamAV on apache.org --089e01229762a5658f04d915db28 Content-Type: text/plain; charset=ISO-8859-1 Dear Sai Sai, you wrote: > key = 0 value = 10 10 > key = 6 value = 20 200 > ... the provided key is the byte offset of the respective line in your input file. See TextInputFormat docs here: http://hadoop.apache.org/docs/stable/api/org/apache/hadoop/mapred/TextInputFormat.html I guess this is used as default, as you didn't specify something different by using FileInputFormat... Best regards, Jens Best regards, Jens --089e01229762a5658f04d915db28 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Dear Sai Sai,

you wrote:
> key =3D 0 value =3D 10=A0=A0=A0 10
> k= ey =3D 6 value =3D 20=A0=A0=A0 200
> ...

the provided k= ey is the byte offset of the respective line in your input file.
<= div>See TextInputFormat docs here: http://hadoop.ap= ache.org/docs/stable/api/org/apache/hadoop/mapred/TextInputFormat.html<= br>
I guess this is used as default, as you didn't specify s= omething different by using FileInputFormat...

Best regar= ds,

Jens





Best reg= ards,

Jens
--089e01229762a5658f04d915db28--