Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4605610807 for ; Mon, 26 Aug 2013 04:09:30 +0000 (UTC) Received: (qmail 31322 invoked by uid 500); 26 Aug 2013 04:09:21 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 31220 invoked by uid 500); 26 Aug 2013 04:09:19 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 31209 invoked by uid 99); 26 Aug 2013 04:09:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Aug 2013 04:09:17 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of anilgupta84@gmail.com designates 209.85.215.179 as permitted sender) Received: from [209.85.215.179] (HELO mail-ea0-f179.google.com) (209.85.215.179) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Aug 2013 04:09:12 +0000 Received: by mail-ea0-f179.google.com with SMTP id b10so1329566eae.38 for ; Sun, 25 Aug 2013 21:08:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=qkSVCdmyM02kI4X8KInrJSWgT4lEulCw1h7k1ou59ms=; b=S9VT0mntM/J0lenZfhlPKRrgpwN67ulJuyr03CxW0zQotQgx3BCcMU6jHgXuQc28t/ QJujZWUkE0przpAbxMPaqEdMJQnNU0HttB/ZgcAlECKwM18vWnmakVHrdBmDKMgThGBJ bdniFHzcNpkl75BOxY7BCxDPASwdYlBd9BoGfYoW292urnxrANrVadqoKAmabh8DVtLE 8yqSiJhMilGdszRs1XpnwdhueNmp83uxL2EXtctEp84xWX9J/ZL5iD2lA75fW/QC9QRK 440hhDr82UazO7j3YW9rkrB6qvtiu6BsCFb/c0ws+NU1jpdfrjJFd6cufj7aD30b2kPM fXfg== X-Received: by 10.14.126.73 with SMTP id a49mr468300eei.48.1377490130683; Sun, 25 Aug 2013 21:08:50 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.84.196 with HTTP; Sun, 25 Aug 2013 21:08:30 -0700 (PDT) In-Reply-To: References: From: anil gupta Date: Sun, 25 Aug 2013 21:08:30 -0700 Message-ID: Subject: Re: Mapper and Reducer takes longer than usual for a HBase table aggregation task To: "common-user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a11c27524ebf84004e4d1e956 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c27524ebf84004e4d1e956 Content-Type: text/plain; charset=GB2312 Content-Transfer-Encoding: quoted-printable Hi Pavan, Standalone cluster? How many RS you are running?What are you trying to achieve in MR? Have you tried increasing scanner caching? Slow is very theoretical unless we know some more details of your stuff. ~Anil On Sun, Aug 25, 2013 at 5:52 PM, =C0=EE=BA=E9=D6=D2 wr= ote: > You need release your map code here to analyze the question. generally, > when map/reduce hbase, scanner with filter(s) is used. so the mapper cou= nt > is the hbase region count in your hbase table. > As the reason why you reduce so slow, I guess, you have an disaster join > on the three tables, which cause too many rows. > > =D3=DA 2013/8/26 4:36, Pavan Sudheendra =D0=B4=B5=C0: > > Another Question, why does it indicate number of mappers as 1? Can i >> change it so that multiple mappers perform computation? >> > > --=20 Thanks & Regards, Anil Gupta --001a11c27524ebf84004e4d1e956 Content-Type: text/html; charset=GB2312 Content-Transfer-Encoding: quoted-printable
Hi Pavan,

Standalone cluster? How m= any RS you are running?What are you trying to achieve in MR? Have you tried= increasing scanner caching?
Slow is very theoretical unless we know som= e more details of your stuff.

~Anil



=
On Sun, Aug 25, 2013 at 5:52 PM, =C0=EE=BA=E9=D6= =D2 <lhztop@hotmail.com> wrote:
You need release your map code here to analy= ze the question. generally, when map/reduce hbase,  scanner with filte= r(s) is used. so the mapper count is the hbase region count in your hbase t= able.
As the reason why you reduce so slow, I guess, you have an disaster join on= the three tables, which cause too many rows.

=D3=DA 2013/8/26 4:36, Pavan Sudheendra =D0=B4=B5=C0:
=

Another Question, why does it indicate number of mappers as 1? Can i change= it so that multiple mappers perform computation?




--
Thanks &= ; Regards,
Anil Gupta
--001a11c27524ebf84004e4d1e956--