Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A1DB2D2FD for ; Tue, 24 Jul 2012 02:09:02 +0000 (UTC) Received: (qmail 27516 invoked by uid 500); 24 Jul 2012 02:09:00 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 27472 invoked by uid 500); 24 Jul 2012 02:09:00 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 27464 invoked by uid 99); 24 Jul 2012 02:09:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Jul 2012 02:09:00 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of imperros@gmail.com designates 74.125.82.51 as permitted sender) Received: from [74.125.82.51] (HELO mail-wg0-f51.google.com) (74.125.82.51) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Jul 2012 02:08:52 +0000 Received: by wgbed3 with SMTP id ed3so4789498wgb.20 for ; Mon, 23 Jul 2012 19:08:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:x-forwarded-message-id:content-type; bh=Az+eArba9+2JqPV4jjUT3tdHxnvabFLGK78yWqMKVKY=; b=W8I2yBzk5VHcS0HOg5Ztnt5kItMYaJOOIrQ0EMm1ebIdX56MSOdp9LK9wYngegwPFR V+m1wVWa2EAK8c5mKVpxm798emcH12kfKYXbPr6GNjsRKLxa80GAeO3pHKhub28CbWwr VGmQPfX9Lm+ZkaSGZtakSWRnIW5NkQk3P4R8wlHDWco1wvCloooQnT02Q/XKDk6n8hIw 5UfVROG0WRovuwyT89d1ZnVqwJIZzv8yvF/v0QIiP/7IrQUZBFHLTi1Vi/VZmx34qHRI igVix5pXX8m6U/nefpGyo7jon63mMy83NTCp4SFl4ZgbpS4GaUeJRhAPuXEcIKMARpLn NRMg== Received: by 10.216.119.6 with SMTP id m6mr1761871weh.215.1343095711577; Mon, 23 Jul 2012 19:08:31 -0700 (PDT) Received: from [192.168.1.103] (77.49.104.194.dsl.dyn.forthnet.gr. [77.49.104.194]) by mx.google.com with ESMTPS id b7sm2785547wiz.9.2012.07.23.19.08.28 (version=SSLv3 cipher=OTHER); Mon, 23 Jul 2012 19:08:30 -0700 (PDT) Message-ID: <500E039B.1060906@gmail.com> Date: Tue, 24 Jul 2012 05:08:27 +0300 From: Ioakim Perros User-Agent: Mozilla/5.0 (X11; Linux i686; rv:14.0) Gecko/20120714 Thunderbird/14.0 MIME-Version: 1.0 To: user@hbase.apache.org Subject: Re: Efficient read/write - Iterative M/R jobs References: <500DDB00.4000002@gmail.com> In-Reply-To: <500DDB00.4000002@gmail.com> X-Forwarded-Message-Id: <500DDB00.4000002@gmail.com> Content-Type: multipart/alternative; boundary="------------040403010605020501090601" X-Virus-Checked: Checked by ClamAV on apache.org --------------040403010605020501090601 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Update (for anyone ending up here after a possible google search on the issue) : Finally, running M/R job in order to bulk import data in a pseudo-distributed is feasible (for testing purposes) . The error concerning TotalOrderPartitioner had something to do with a trivial bug at the keys I passed from mappers. The thing is that you need to add "guava-r09.jar" (or any version of latest guava I suppose - it is located under lib folder of hbase setup path) to the lib folder of hadoop setup path. I suppose that in order for the same job to run on a truly distributed environment, one has to add -libjars /path/to/guava.jar to the options of hadoop jar command. On 07/24/2012 02:06 AM, Jean-Daniel Cryans wrote: >> ... INFO mapred.JobClient: Task Id : attempt_201207232344_0001_m_000000_0, >> Status : FAILED >> java.lang.IllegalArgumentException: *Can't read partitions file* >> at >> org.apache.hadoop.hbase.mapreduce.hadoopbackport.TotalOrderPartitioner.setConf(TotalOrderPartitioner.java:111) >> ... >> >> I followed this link, while googling for the solution : >> http://hbase.apache.org/book/trouble.mapreduce.html >> and it implies a misconfiguration concerning a fully distributed >> environment. >> >> I would like, therefore, to ask if it is even possible to bulk import data >> in a pseudo-distributed mode and if this is the case, does anyone have a >> guess about this error? > AFAIK you just can't use the local job tracker for this, so you do > need to start one. > > J-D --------------040403010605020501090601--