Return-Path: X-Original-To: apmail-hadoop-common-dev-archive@www.apache.org Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 70C317682 for ; Tue, 4 Oct 2011 23:41:19 +0000 (UTC) Received: (qmail 8745 invoked by uid 500); 4 Oct 2011 23:41:18 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 8698 invoked by uid 500); 4 Oct 2011 23:41:18 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 8690 invoked by uid 99); 4 Oct 2011 23:41:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Oct 2011 23:41:18 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of saint.ack@gmail.com designates 209.85.216.48 as permitted sender) Received: from [209.85.216.48] (HELO mail-qw0-f48.google.com) (209.85.216.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Oct 2011 23:41:09 +0000 Received: by qadb14 with SMTP id b14so1227743qad.35 for ; Tue, 04 Oct 2011 16:40:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; bh=CLVwjWTjOqgxEQc7nSq+4AM1gnkE1x/tgv8ghYLRFc4=; b=vOl7KSQQzUxXbBWIScwNOHKMtpHTYYjK1sYpinLmLJfbyioaVVGmcedxvGWJ09d/Qy ynWmUa9TWQK7fP7wM8IOWolhExej6GK/mgEvpG4Wzpgoi404kYg7D2qEOPgdBVLtWN27 XVEEmEjOJ/bV6eCrHrucKvhTIy8FVyjuT4dKg= MIME-Version: 1.0 Received: by 10.224.200.200 with SMTP id ex8mr1441121qab.379.1317771648504; Tue, 04 Oct 2011 16:40:48 -0700 (PDT) Sender: saint.ack@gmail.com Received: by 10.224.80.212 with HTTP; Tue, 4 Oct 2011 16:40:48 -0700 (PDT) In-Reply-To: References: Date: Tue, 4 Oct 2011 16:40:48 -0700 X-Google-Sender-Auth: vaKJgvmxxzROEfHvbXTIm1ojZoI Message-ID: Subject: Re: 0.20.205.0 Release Candidate 1 Testing From: Stack To: common-dev@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On Tue, Oct 4, 2011 at 3:41 PM, Matt Foley wrote: > I am going to spin an RC2 early tomorrow. =A0Does anyone have other issue= s > they consider critical for 205.0? I've been playing with it. Recovering the lease on an open file (An HBase WAL) the length is always zero and I don't seem to be able to recover any edits from the file we writing at time of the crash: 2011-10-04 21:17:04,486 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Splitting hlog 34 of 34: hdfs://sv4r11s38:7000/hbase/.logs/sv4r8s38,7003,1317760866490/sv4= r8s38%3A7003.1317762914728, length=3D0 2011-10-04 21:17:04,486 INFO org.apache.hadoop.hbase.util.FSUtils: Recovering file hdfs://sv4r11s38:7000/hbase/.logs/sv4r8s38,7003,1317760866490/sv4r8s38%3A70= 03.1317762914728 2011-10-04 21:17:05,487 INFO org.apache.hadoop.hbase.util.FSUtils: Finished lease recover attempt for hdfs://sv4r11s38:7000/hbase/.logs/sv4r8s38,7003,1317760866490/sv4r8s38%3A70= 03.1317762914728 2011-10-04 21:17:05,488 WARN org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: File hdfs://sv4r11s38:7000/hbase/.logs/sv4r8s38,7003,1317760866490/sv4r8s38%3A70= 03.1317762914728 might be still open, length is 0 Its probably me misconfiguring 205 compared to 0.20-append. I got some of these tooo though I'd just opened the file a few seconds earl= ier: 2011-10-04 21:16:28,439 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Creating writer path=3Dhdfs://sv4r11s38:7000/hbase/TestTable/62ff2cb514838519e5fa4282a8af4c= 35/recovered.edits/0000000000000008111 region=3D62ff2cb514838519e5fa4282a8af4c35 .... 2011-10-04 21:17:06,883 ERROR org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Couldn't close log at hdfs://sv4r11s38:7000/hbase/TestTable/62ff2cb514838519e5fa4282a8af4c= 35/recovered.edits/0000000000000008111 org.apache.hadoop.ipc.RemoteException: org.apache.hadoop.hdfs.server.namenode.LeaseExpiredException: No lease on /hbase/TestTable/62ff2cb514838519e5fa4282a8af4c35/recovered.edits/000000= 0000000008111 File does not exist. [Lease. Holder: DFSClient_hb_m_sv4r11s38:7001_1317760883384, pendingcreates: 3] at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(F= SNamesystem.java:1604) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(F= SNamesystem.java:1595) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFile= Internal(FSNamesystem.java:1650) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFile= (FSNamesystem.java:1638) at org.apache.hadoop.hdfs.server.namenode.NameNode.complete(NameNod= e.java:682) at sun.reflect.GeneratedMethodAccessor15.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethod= AccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388) at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupIn= formation.java:1059) at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382) at org.apache.hadoop.ipc.Client.call(Client.java:1066) at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) at $Proxy6.complete(Unknown Source) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessor= Impl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethod= AccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(R= etryInvocationHandler.java:82) at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryIn= vocationHandler.java:59) at $Proxy6.complete(Unknown Source) at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.closeInternal(D= FSClient.java:3711) at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.close(DFSClient= .java:3626) at org.apache.hadoop.fs.FSDataOutputStream$PositionCache.close(FSDa= taOutputStream.java:61) at org.apache.hadoop.fs.FSDataOutputStream.close(FSDataOutputStream= .java:86) at org.apache.hadoop.io.SequenceFile$Writer.close(SequenceFile.java= :966) at org.apache.hadoop.hbase.regionserver.wal.SequenceFileLogWriter.c= lose(SequenceFileLogWriter.java:138) at org.apache.hadoop.hbase.regionserver.wal.HLogSplitter$OutputSink= .closeStreams(HLogSplitter.java:768) at org.apache.hadoop.hbase.regionserver.wal.HLogSplitter$OutputSink= .finishWritingAndClose(HLogSplitter.java:753) at org.apache.hadoop.hbase.regionserver.wal.HLogSplitter.splitLog(H= LogSplitter.java:300) at org.apache.hadoop.hbase.regionserver.wal.HLogSplitter.splitLog(H= LogSplitter.java:188) at org.apache.hadoop.hbase.master.MasterFileSystem.splitLog(MasterF= ileSystem.java:201) at org.apache.hadoop.hbase.master.handler.ServerShutdownHandler.pro= cess(ServerShutdownHandler.java:153) at org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.j= ava:156) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoo= lExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe= cutor.java:908) at java.lang.Thread.run(Thread.java:662) I'll keep banging at it. St.Ack