Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9FACFDC50 for ; Wed, 19 Dec 2012 00:10:11 +0000 (UTC) Received: (qmail 47276 invoked by uid 500); 19 Dec 2012 00:10:09 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 47014 invoked by uid 500); 19 Dec 2012 00:10:09 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 47006 invoked by uid 99); 19 Dec 2012 00:10:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Dec 2012 00:10:09 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of orlenko.stas@gmail.com designates 209.85.212.49 as permitted sender) Received: from [209.85.212.49] (HELO mail-vb0-f49.google.com) (209.85.212.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Dec 2012 00:10:01 +0000 Received: by mail-vb0-f49.google.com with SMTP id r6so1638630vbi.8 for ; Tue, 18 Dec 2012 16:09:40 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=uYDp+S02JXAiyNof752IH/wYP4/mB+1YU92Bhx09v/o=; b=rQFDu7qfOgpaoAdvUivQKzSL389E/Tnz8s4BDdBePXzVChPCOpyIdmY+M+J4WUfLwl TaNY0N86O2Aw45AAccLS0zYRcsX9+LAg37mH5CkSLiM4vqA8c/pluJYk9xXMqkvp9wtS rtYBPN9LG6CCZ4rwo1auMynOvJ13VFjWhaewwnkF+9yHqrgLDxQY0qUKsH4AYhEtsKJD xgRLw1uTOOUJK0rnAxieux0LTnZZg49ykXUMOeEDHeDZKYMlp306BXMDoeakzPObQOi9 8F668pbZzsnVH7MaD1R5/jQ1y44Ibn4LACj/TEAeZpO00NNJFO7pqmuS5TCtE0AOwEFi +fNg== MIME-Version: 1.0 Received: by 10.52.33.143 with SMTP id r15mr5267215vdi.78.1355875779908; Tue, 18 Dec 2012 16:09:39 -0800 (PST) Received: by 10.220.117.83 with HTTP; Tue, 18 Dec 2012 16:09:39 -0800 (PST) In-Reply-To: References: Date: Wed, 19 Dec 2012 02:09:39 +0200 Message-ID: Subject: Re: checksum exception From: Stanislav Orlenko To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=20cf3077692738940c04d1296e03 X-Virus-Checked: Checked by ClamAV on apache.org --20cf3077692738940c04d1296e03 Content-Type: text/plain; charset=ISO-8859-1 In logs also I see exceptions like: 2012-12-19 01:40:45,320 ERROR org.apache.hadoop.hbase.regionserver.HRegionServer: java.lang.IllegalArgumentException: offset (65577) + length (2) exceed the capacity of the array: 65577 at org.apache.hadoop.hbase.util.Bytes.explainWrongLengthOrOffset(Bytes.java:506) at org.apache.hadoop.hbase.util.Bytes.toShort(Bytes.java:728) at org.apache.hadoop.hbase.util.Bytes.toShort(Bytes.java:714) at org.apache.hadoop.hbase.KeyValue.getRowLength(KeyValue.java:733) at org.apache.hadoop.hbase.KeyValue.getRow(KeyValue.java:894) at org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.peekRow(HRegion.java:2523) at org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.nextInternal(HRegion.java:2454) at org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.next(HRegion.java:2425) at org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.next(HRegion.java:2442) at org.apache.hadoop.hbase.regionserver.HRegionServer.next(HRegionServer.java:1863) at sun.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.hbase.ipc.HBaseRPC$Server.call(HBaseRPC.java:570) at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1039) I am not sure about hadoop version, in libs directory I see hadoop-core-0.20-append-r1056497.jar We use hbase as a backend for Gora in Nutch 2. Gora 0.2 uses HBase 0.90.X. That is why we use hbase 0.90.6. By the way in nutch's lib directory I see hadoop-core-1.0.3.jar Strange thing. After reboot it works without the exception for some time. Although after some period of time the checksum exception appears again. For example after previous reboot it has worked for about 24 hours, but now I see this exception again On Sun, Dec 16, 2012 at 2:44 AM, Ted Yu wrote: > The exception came from FSInputChecker.verifySum(). > > Can you check Namenode log around the time the error happened to find more > information ? > What hadoop version do you use ? > > BTW 0.90.6 is really old. I suggest upgrading to 0.92 or 0.94 > > Cheers > > On Sat, Dec 15, 2012 at 4:13 PM, Stanislav Orlenko > wrote: > > > Hello > > We use HBase 0.90.6 and have the problem: > > > > org.apache.hadoop.hbase.client.RetriesExhaustedException: Trying to > contact > > region server marv.site:56463 for region > > > > > webpage,pl.allegro:http/stare-rozdzielacze-i2839591826.html,1355478823557.80e5ed6af22f12d45eabae89537f8602., > > row 'pl.allegro:http/stare-rozdzielacze-i2839591826.html', but failed > after > > 10 attempts. > > Exceptions: > > java.io.IOException: java.io.IOException: Could not iterate > > StoreFileScanner[HFileScanner for reader > > > > > reader=file:/home/stas/pricex/hbase/webpage/80e5ed6af22f12d45eabae89537f8602/f/5763919690181742358, > > compression=none, inMemory=false, > > > > > firstKey=pl.allegro:http/stare-rozdzielacze-i2839591826.html/f:bas/1355311168483/Put, > > lastKey=pl.gumtree.lodz:http/p-TermsAndConditions/f:ts/1355360972551/Put, > > avgKeyLen=83, avgValueLen=2880, entries=64207, length=191013462, > > > > > cur=pl.allegro:http/swiat-mebli-komoda-srebrna-61980-k14-i2787252255.html/f:cnt/1355480654644/Put/vlen=62169] > > at > > > > > org.apache.hadoop.hbase.regionserver.StoreFileScanner.next(StoreFileScanner.java:89) > > at > > > > > org.apache.hadoop.hbase.regionserver.KeyValueHeap.next(KeyValueHeap.java:82) > > at > > > > > org.apache.hadoop.hbase.regionserver.StoreScanner.next(StoreScanner.java:262) > > at > > > > > org.apache.hadoop.hbase.regionserver.KeyValueHeap.next(KeyValueHeap.java:114) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.nextInternal(HRegion.java:2469) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.next(HRegion.java:2425) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.next(HRegion.java:2442) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegionServer.next(HRegionServer.java:1863) > > at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > > at java.lang.reflect.Method.invoke(Method.java:597) > > at > > org.apache.hadoop.hbase.ipc.HBaseRPC$Server.call(HBaseRPC.java:570) > > at > > > org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1039) > > Caused by: org.apache.hadoop.fs.ChecksumException: Checksum error: > > > > > file:/home/stas/pricex/hbase/webpage/80e5ed6af22f12d45eabae89537f8602/f/5763919690181742358 > > at 28099584 > > at > > org.apache.hadoop.fs.FSInputChecker.verifySum(FSInputChecker.java:277) > > at > > > > > org.apache.hadoop.fs.FSInputChecker.readChecksumChunk(FSInputChecker.java:241) > > at > > org.apache.hadoop.fs.FSInputChecker.read1(FSInputChecker.java:189) > > at > > org.apache.hadoop.fs.FSInputChecker.read(FSInputChecker.java:158) > > at java.io.DataInputStream.read(DataInputStream.java:132) > > at > > > > > org.apache.hadoop.hbase.io.hfile.BoundedRangeFileInputStream.read(BoundedRangeFileInputStream.java:105) > > at > java.io.BufferedInputStream.read1(BufferedInputStream.java:256) > > at java.io.BufferedInputStream.read(BufferedInputStream.java:317) > > at org.apache.hadoop.io.IOUtils.readFully(IOUtils.java:102) > > at > > org.apache.hadoop.hbase.io.hfile.HFile$Reader.decompress(HFile.java:1094) > > at > > org.apache.hadoop.hbase.io.hfile.HFile$Reader.readBlock(HFile.java:1036) > > at > > > org.apache.hadoop.hbase.io.hfile.HFile$Reader$Scanner.next(HFile.java:1276) > > at > > > > > org.apache.hadoop.hbase.regionserver.StoreFileScanner.next(StoreFileScanner.java:87) > > ... 12 more > > > > java.io.IOException: java.io.IOException: > > java.lang.IllegalArgumentException > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegionServer.convertThrowableToIOE(HRegionServer.java:997) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegionServer.convertThrowableToIOE(HRegionServer.java:986) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegionServer.next(HRegionServer.java:1887) > > at sun.reflect.GeneratedMethodAccessor6.invoke(Unknown Source) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > > at java.lang.reflect.Method.invoke(Method.java:597) > > at > > org.apache.hadoop.hbase.ipc.HBaseRPC$Server.call(HBaseRPC.java:570) > > at > > > org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1039) > > Caused by: java.lang.IllegalArgumentException > > at java.nio.Buffer.position(Buffer.java:218) > > at > > > org.apache.hadoop.hbase.io.hfile.HFile$Reader$Scanner.next(HFile.java:1266) > > at > > > > > org.apache.hadoop.hbase.regionserver.StoreFileScanner.next(StoreFileScanner.java:87) > > at > > > > > org.apache.hadoop.hbase.regionserver.KeyValueHeap.next(KeyValueHeap.java:82) > > at > > > > > org.apache.hadoop.hbase.regionserver.StoreScanner.next(StoreScanner.java:262) > > at > > > > > org.apache.hadoop.hbase.regionserver.KeyValueHeap.next(KeyValueHeap.java:114) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.nextInternal(HRegion.java:2469) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.next(HRegion.java:2425) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegion$RegionScanner.next(HRegion.java:2442) > > at > > > > > org.apache.hadoop.hbase.regionserver.HRegionServer.next(HRegionServer.java:1863) > > ... 5 more > > > > We do not use special configuration. Only hbase.rootdir and > > hbase.zookeeper.property.dataDir are specified in hbase-site. > > What can be a reason for such behavior? > > > > Thanks in advance > > > --20cf3077692738940c04d1296e03--