Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0A971C2C0 for ; Fri, 25 May 2012 20:49:53 +0000 (UTC) Received: (qmail 24813 invoked by uid 500); 25 May 2012 20:49:51 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 24767 invoked by uid 500); 25 May 2012 20:49:51 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 24757 invoked by uid 99); 25 May 2012 20:49:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 May 2012 20:49:51 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of eychih@gmail.com designates 209.85.160.41 as permitted sender) Received: from [209.85.160.41] (HELO mail-pb0-f41.google.com) (209.85.160.41) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 May 2012 20:49:42 +0000 Received: by pbbrp2 with SMTP id rp2so2822363pbb.14 for ; Fri, 25 May 2012 13:49:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=K9n/pJ7E6LHbY9Cj3WsHy+uJ8rw5gFphe7TS0H/9mlc=; b=Qf+Y3fCqqQRWriBIr1P/0LqUV/P7WM20CWEYpw4D5IWzRZJ2Mngp6leeivIzqRE0Mr 4sghee3We9tb9uG+BmuxfIGBe51fztvlUYzc4D046v5zi18QD5BWD8vYNWnEFdnKqD8E 2GBUNzB7N56aNm+6CnjVFOgHbayj2DLXWMqg6CWXugpYIAUumklWae02nfrzAmuHAYkS z5KmKR8U77NUrK7iETP/pRKrOLY7lvMKWjwe13dbRUBI3SjhO/FC1VlpLEM0QHqH79/Z fD/kLAACFME42RHcNN5mEKhKXN/8eqb6PqvV6z+zqofyKS8YuJeBX3WnBg4f6VyGO9mj 6sCg== Received: by 10.68.231.195 with SMTP id ti3mr910419pbc.96.1337978960751; Fri, 25 May 2012 13:49:20 -0700 (PDT) Received: from hg934r1-pc.booyahcorp.com (gwfw1-nat.booyahcorp.com. [173.227.6.108]) by mx.google.com with ESMTPS id ku7sm10137982pbc.31.2012.05.25.13.49.19 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 25 May 2012 13:49:20 -0700 (PDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1084) Subject: Re: improve performance of a MapReduce job with HBase input From: Ey-Chih chow In-Reply-To: Date: Fri, 25 May 2012 13:49:19 -0700 Content-Transfer-Encoding: 7bit Message-Id: References: <0ADBD8BB-4DFC-43B9-A05A-1593CEC32E0A@gmail.com> To: user@hbase.apache.org X-Mailer: Apple Mail (2.1084) Thanks. This help. Ey-Chih Chow On May 25, 2012, at 11:23 AM, Dave Revell wrote: > Here's what I do: > > Scan scan = new Scan(...) > scan.setCaching(5000); > scan.setWhatever(...); > > TableMapReduceUtil.initTableMapperJob(tablename, scan, mapClass, > mapOutKeyClass, mapOutValueClass, job); > > Does that help? > > -Dave > > On Fri, May 25, 2012 at 11:03 AM, Ey-Chih chow wrote: > >> Hi, >> >> We have a MapReduce job of which input data is from HBase. We would like >> to improve performance of the job. According to the HBase book, we can do >> that by setting scan caching to a number higher than default. We use >> TableInputFormat to read data from the job. I look at the implementation >> of the class. The class does not set caching when a scan object is >> created. Is there anybody know how to externally set caching for the scan >> created in TableInputFormat? Thanks. >> >> Ey-Chih Chow