Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 48298 invoked from network); 10 Dec 2010 07:01:05 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 10 Dec 2010 07:01:05 -0000 Received: (qmail 90481 invoked by uid 500); 10 Dec 2010 07:01:04 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 90253 invoked by uid 500); 10 Dec 2010 07:01:03 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 90244 invoked by uid 500); 10 Dec 2010 07:01:03 -0000 Delivered-To: apmail-hadoop-hbase-user@hadoop.apache.org Received: (qmail 90240 invoked by uid 99); 10 Dec 2010 07:01:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Dec 2010 07:01:03 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,SPF_HELO_PASS,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Dec 2010 07:00:58 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1PQwy6-0000v0-Bn for hbase-user@hadoop.apache.org; Thu, 09 Dec 2010 23:00:38 -0800 Message-ID: <30423902.post@talk.nabble.com> Date: Thu, 9 Dec 2010 23:00:38 -0800 (PST) From: rajgopalv To: hbase-user@hadoop.apache.org Subject: Re: Zoo keeper exception in the middle of MR In-Reply-To: <30423879.post@talk.nabble.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: raja.fire@gmail.com References: <30396344.post@talk.nabble.com> <30412978.post@talk.nabble.com> <30415615.post@talk.nabble.com> <30423879.post@talk.nabble.com> OOPS! in some forum pages, the XML tags created some problem.. http://pastebin.com/2wGdswft so here's my previous reply [http://pastebin.com/2wGdswft ].. sorry for the trouble. :( rajgopalv wrote: > > Suraj, > > Hbase works when i work with smaller clusters, so i dont think hbase is > the problem. But Now i'm trying to include conf directory in classpath and > try again. > > But please tell me this, I dont find any proper documentation for starting > hbase in fully distributed mode. > > So please help me : > hbase-site.xml [master & slave] > > > > hbase.rootdir > hdfs://master.hadoopcluster:9000/hbase > The directory shared by region > servers. > > > hbase.cluster.distributed > true > > > hbase.zookeeper.quorum > > master.hadoopcluster,slave1.hadoopcluster,slave2.hadoopcluster > > > hbase.zookeeper.property.clientPort > 2181 > > > hbase.tmp.dir > /home/user/space/hbase-${user.name} > > > hbase.zookeeper.property.dataDir > ${hbase.tmp.dir}/zookeeper > > > > > ======== > > the regionservers file [master] > master.hadoopcluster > slave1.hadoopcluster > slave2.hadoopcluster > > the regionservers file [slave1] > slave1.hadoopcluster > > the regionservers file [slave2] > slave2.hadoopcluster > > > ======================================================== > > netstat -ane | grep java > > showed me : > > tcp 0 0 ::ffff:172.21.203.112:2181 > ::ffff:172.21.203.112:14271 ESTABLISHED 4850/java > > its my local IP, not 127.0.0.1 .. i hope that is okay.!? > > > rajgopalv wrote: >> >>>>From the logs, it looks like you don't have hbase conf directory in the >> classpath. Can you recheck? Also - in what mode are you running hbase? >> Fully >> distributed? If so, is zookeeper running locally (localhost:2181). >> >> My guess is that you are missing the hbase conf directory in your >> classpath. >> --Suraj >> >> >> Ted, >> >> For small data it works fine.! >> >> I tried reading 100 rows from a CSV and inserted into hbase, it worked. >> Now 15Million rows is not working. Stuck with this really bad.!!! %-| >> >> >> Ted Dunning-2 wrote: >>> >>> Very small clusters are often problematic but your logs look like your >>> cluster has something really hosey going on beyond just process going >>> missing for a time. I don't know what it is, off-hand, but it is ugly. >>> Approaching this cold, I would not assume >>> that anything is correct. Thus I would look at network configuration, >>> DNS >>> and other simple things. >>> >>> Can you run small test jobs correctly or does everything mess up? >>> >>> On Wed, Dec 8, 2010 at 8:26 PM, rajgopalv wrote: >>> >>>> >>>> Ted, >>>> >>>> I've tried incrementing my own counter in every map job, but this keep >>>> happening. >>>> Kindly look at the log here http://pastebin.com/Xv76mXDJ >>>> http://pastebin.com/Xv76mXDJ >>>> >>>> One more question, >>>> I have a small cluster of small computers now. Cluster contains 2 >>>> machines, >>>> each of 2GB ram, dual core. but i've increased the hadoop and hbase >>>> heapsize >>>> to 1.5 gb. will this create any problem ? (other than slowing down the >>>> process, i dont think this will lead to errors like what is in the log >>>> that >>>> i've given above) >>>> >>>> >>>> Ted Dunning-2 wrote: >>>> > >>>> > lt looks like your task took a long time to complete (> 10 minutes) >>>> and >>>> > didn't produce any output or report any status to Hadoop during this >>>> time. >>>> > >>>> > This often happens during indexing tasks where a reducer or mapper >>>> builds >>>> > some off-line data structure for a long time. Can you force your >>>> mappers >>>> > to >>>> > update a Hadoop counter as they go along? That might be all that is >>>> > needed. >>>> > >>>> > On Tue, Dec 7, 2010 at 5:37 AM, rajgopalv >>>> wrote: >>>> > >>>> >> Task attempt_201012071646_0001_m_000025_0 failed to report status >>>> for >>>> 600 >>>> >> seconds. Killing! >>>> >> >>>> > >>>> > >>>> >>>> -- >>>> View this message in context: >>>> http://old.nabble.com/Zoo-keeper-exception-in-the-middle-of-MR-tp30396344p30412978.html >>>> Sent from the HBase User mailing list archive at Nabble.com. >>>> >>>> >>> >>> >> >> > > -- View this message in context: http://old.nabble.com/Zoo-keeper-exception-in-the-middle-of-MR-tp30396344p30423902.html Sent from the HBase User mailing list archive at Nabble.com.