Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B6612DC40 for ; Tue, 25 Sep 2012 08:33:28 +0000 (UTC) Received: (qmail 3517 invoked by uid 500); 25 Sep 2012 08:33:24 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 3224 invoked by uid 500); 25 Sep 2012 08:33:19 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 2922 invoked by uid 99); 25 Sep 2012 08:33:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Sep 2012 08:33:17 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of sigurd.spieckermann@gmail.com designates 209.85.160.48 as permitted sender) Received: from [209.85.160.48] (HELO mail-pb0-f48.google.com) (209.85.160.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Sep 2012 08:33:10 +0000 Received: by pbbrq13 with SMTP id rq13so16920072pbb.35 for ; Tue, 25 Sep 2012 01:32:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=XEZ+4HS4bI0yi7r0VwOsx1RiwRjHkiiY3b6VGXg65tM=; b=R21APxWNd3FCvpgUCq2flMwtQ/SBn9zzVY/t32T2mAPiIDS9/dJ+X7NcSPXmjPj4ut iDP/bPio6VB7TYTLPPwdE5TflGzfBKFYJrDlhpOK0a4eoz2oy/tjTnBZu3Lq9StH1jBb p64iCfI5p8jbh3JoIoEciRw93yO3gUQKokjIO+vtzksJRgRrSKXFf5QbiVnw/GBhBNme ZgfoyVX+1MyGPd1NHpbUjgCPUOR//kBliW8Iu9P1+crE6kLn5uUjQzfHuHZTeatnPEOK kIYblDIM8saUCDqTjGEvRD11PKjZ7H4Bkxxl+ZIkVl0RaLRw6Q6o/cUU5SNYMkKfFzQw kfaQ== MIME-Version: 1.0 Received: by 10.68.136.40 with SMTP id px8mr43814645pbb.153.1348561970210; Tue, 25 Sep 2012 01:32:50 -0700 (PDT) Received: by 10.68.30.74 with HTTP; Tue, 25 Sep 2012 01:32:50 -0700 (PDT) Date: Tue, 25 Sep 2012 10:32:50 +0200 Message-ID: Subject: Join-package combiner number of input and output records the same From: Sigurd Spieckermann To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=047d7b15a80531272404ca828deb X-Virus-Checked: Checked by ClamAV on apache.org --047d7b15a80531272404ca828deb Content-Type: text/plain; charset=ISO-8859-1 Hi guys, I'm experiencing a strange behavior when I use the Hadoop join-package. After running a job the result statistics show that my combiner has an input of 100 records and an output of 100 records. From the task I'm running and the way it's implemented, I know that each key appears multiple times and the values should be combinable before getting passed to the reducer. I'm running my tests in pseudo-distributed mode with one or two map tasks. From using the debugger, I noticed that each key-value pair is processed by a combiner individually so there's actually no list passed into the combiner that it could aggregate. Can anyone think of a reason that causes this undesired behavior? Thanks Sigurd --047d7b15a80531272404ca828deb Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi guys,

I'm experiencing a strange behavior when I use the Hado= op join-package. After running a job the result statistics show that my com= biner has an input of 100 records and an output of 100 records. From the ta= sk I'm running and the way it's implemented, I know that each key a= ppears multiple times and the values should be combinable before getting pa= ssed to the reducer. I'm running my tests in pseudo-distributed mode wi= th one or two map tasks. From using the debugger, I noticed that each key-v= alue pair is processed by a combiner individually so there's actually n= o list passed into the combiner that it could aggregate. Can anyone think o= f a reason that causes this undesired behavior?

Thanks
Sigurd
--047d7b15a80531272404ca828deb--