Return-Path: Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: (qmail 87169 invoked from network); 13 Feb 2011 17:10:24 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 13 Feb 2011 17:10:24 -0000 Received: (qmail 7448 invoked by uid 500); 13 Feb 2011 17:10:24 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 7158 invoked by uid 500); 13 Feb 2011 17:10:22 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 7145 invoked by uid 99); 13 Feb 2011 17:10:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 13 Feb 2011 17:10:21 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yuzhihong@gmail.com designates 209.85.161.41 as permitted sender) Received: from [209.85.161.41] (HELO mail-fx0-f41.google.com) (209.85.161.41) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 13 Feb 2011 17:10:17 +0000 Received: by fxm12 with SMTP id 12so4775669fxm.14 for ; Sun, 13 Feb 2011 09:09:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=HSBdsd2pkwd+FU2y4ANjJR/QCfBn4EySZaYFM0P4YKs=; b=mQaBbmVMU3OkP37lxgdl+7D2K9G7N+hi1/2odVXGUzhhfxIkSm2V+MwFo97cJaESw4 bU+G/rciLtvOa8xIz4nYX+oH2uZLfVU4BT9eo6h3q22Qbdtd7rM8PiBSEtlxsCSvGj/Q YC3tIzY1mrHv2BLVfUwWGAg4y+pNy6KnPgl60= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=iS+s5OzzAb36C/Jo3w/jMrnG5BiI1MaKF9UkYL/Ym7x/ZhxV2uXgHYNuBTBhfE31/C b0UOHkd7Kk1NoqHAKg/U147R8UQR7C+2via3FCyx7iLshUwxF6mI8cuxKeiNoa6TMVJa xmc6Hmc1FJQYoHQleOm2awszljQjuV4ECslXI= MIME-Version: 1.0 Received: by 10.223.85.208 with SMTP id p16mr3361795fal.107.1297616994909; Sun, 13 Feb 2011 09:09:54 -0800 (PST) Received: by 10.223.151.6 with HTTP; Sun, 13 Feb 2011 09:09:54 -0800 (PST) In-Reply-To: References: Date: Sun, 13 Feb 2011 09:09:54 -0800 Message-ID: Subject: Re: initial experience with HBase 0.90.1 rc0 From: Ted Yu To: dev@hbase.apache.org Content-Type: multipart/alternative; boundary=20cf3054a66f093ed2049c2d00d4 --20cf3054a66f093ed2049c2d00d4 Content-Type: text/plain; charset=ISO-8859-1 Here is partial config I used: http://pastebin.com/1Dpbb2LA I verified that there is no hbase-0.90.1.jar in lib dir. Thanks On Sun, Feb 13, 2011 at 8:59 AM, Ted Yu wrote: > BTW > The timeout (when calling flushCommits) happened midnight, so I didn't > capture jstack. > > In hadoop1 region server log, I see this around time of timeout in 4th run: > > 2011-02-13 08:25:01,015 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: > Finished snapshotting, commencing flushing stores > 2011-02-13 08:25:01,016 WARN org.apache.hadoop.ipc.HBaseServer: IPC Server > Responder, call flushRegion(REGION => {NAME => > 'NIGHTLYDEVGRIDSGRIDSQL-THREEGPPSPEECHCALLS-1297583809865,2>&U\xF6\xB582>&U\xF6\xB582>&U\xF6\xB582>&U\xF6\xB582>&T,1297583814638.8cb772d452dee232306dfab0b472ec9a.', > STARTKEY => '2>&U\xF6\xB582>&U\xF6\xB582>&U\xF6\xB582>&U\xF6\xB582>&T', > ENDKEY => > '2\xC1\xA3\xDFhVz2\xC1\xA3\xDFhVz2\xC1\xA3\xDFhVz2\xC1\xA3\xDFhVz2\xC1\xA3\xDD', > ENCODED => 8cb772d452dee232306dfab0b472ec9a, TABLE => {{NAME => > 'NIGHTLYDEVGRIDSGRIDSQL-THREEGPPSPEECHCALLS-1297583809865', FAMILIES => > [{NAME => 'd', BLOOMFILTER => 'ROW', REPLICATION_SCOPE => '0', VERSIONS => > '2', COMPRESSION => 'GZ', TTL => '31536000', BLOCKSIZE => '65536', IN_MEMORY > => 'false', BLOCKCACHE => 'false'}, {NAME => 'i', BLOOMFILTER => 'ROW', > REPLICATION_SCOPE => '0', VERSIONS => '2', COMPRESSION => 'GZ', TTL => > '31536000', BLOCKSIZE => '65536', IN_MEMORY => 'false', BLOCKCACHE => > 'false'}, {NAME => 'v', BLOOMFILTER => 'ROW', REPLICATION_SCOPE => '0', > VERSIONS => '2', COMPRESSION => 'GZ', TTL => '31536000', BLOCKSIZE => > '65536', IN_MEMORY => 'false', BLOCKCACHE => 'false'}]}}) from > 10.202.50.76:62489: output error > 2011-02-13 08:25:01,020 WARN org.apache.hadoop.ipc.HBaseServer: PRI IPC > Server handler 3 on 60020 caught: java.nio.channels.ClosedChannelException > at > sun.nio.ch.SocketChannelImpl.ensureWriteOpen(SocketChannelImpl.java:133) > at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:324) > at > org.apache.hadoop.hbase.ipc.HBaseServer.channelWrite(HBaseServer.java:1339) > at > org.apache.hadoop.hbase.ipc.HBaseServer$Responder.processResponse(HBaseServer.java:727) > at > org.apache.hadoop.hbase.ipc.HBaseServer$Responder.doRespond(HBaseServer.java:792) > at > org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1083) > > On Thu, Feb 10, 2011 at 2:41 PM, Ted Yu wrote: > >> I replaced hbase jar with hbase-0.90.1.jar >> I also upgraded client side jar to hbase-0.90.1.jar >> >> Our map tasks were running faster than before for about 50 minutes. >> However, map tasks then timed out calling flushCommits(). This happened even >> after fresh restart of hbase. >> >> I don't see any exception in region server logs. >> >> In master log, I found: >> >> 2011-02-10 18:24:15,286 DEBUG >> org.apache.hadoop.hbase.master.handler.OpenedRegionHandler: Opened region >> -ROOT-,,0.70236052 on sjc1-hadoop6.X.com,60020,1297362251595 >> 2011-02-10 18:24:15,349 INFO >> org.apache.hadoop.hbase.catalog.CatalogTracker: Failed verification of >> .META.,,1 at address=null; >> org.apache.hadoop.hbase.NotServingRegionException: >> org.apache.hadoop.hbase.NotServingRegionException: Region is not online: >> .META.,,1 >> 2011-02-10 18:24:15,350 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: >> master:60000-0x12e10d0e31e0000 Creating (or updating) unassigned node for >> 1028785192 with OFFLINE state >> >> I am attaching region server (which didn't respond to stop-hbase.sh) >> jstack. >> >> FYI >> >> On Thu, Feb 10, 2011 at 10:10 AM, Stack wrote: >> >>> Thats probably enough Ted. The 0.90.1 hbase-default.xml has an extra >>> config. to enable the experimental HBASE-3455 feature but you can copy >>> that over if you want to try playing with it (it defaults off so you'd >>> copy over the config. if you wanted to set it to true). >>> >>> St.Ack >>> >> >> > --20cf3054a66f093ed2049c2d00d4--