hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: Region Server Start Failure
Date Mon, 18 May 2015 01:47:32 GMT
Can you give us a bit more information - I assume hbase config is the same
on this server compared with the other region servers:

region server log preceding 'Failed to close file' exception - I looked for
'Caused by' but didn't find any.

is this a secure cluster ?

w.r.t. /apps/hbase/data/WALs/ip-10-0-1-74.ec2.internal,60020,
1431910460262/ip-10-0-1-74.ec2.internal%2C60020%
2C1431910460262.1431910462148
Can you look for the above file in hdfs audit log to see when it was
deleted / renamed ?

Thanks

On Sun, May 17, 2015 at 6:33 PM, Billy Watson <williamrwatson@gmail.com>
wrote:

> I added a region server process to an existing cluster node that didn't
> previously have a region server process and I'm getting a weird error (see
> stack traces below).
>
> I'm using HDP 2.1, HBase 0.98.0.2.1.7.0-784-hadoop2,
> hadoop 2.4.0.2.1.7.0-784. All the other 6 region servers are up without
> issue.
>
> I've tried starting the server multiple times, an HDFS balance (that's
> still going after 24+ hours) an HBase master restart.
>
> Thanks for any help!
>
> 2015-05-18 00:54:22,586 ERROR [Thread-20] hdfs.DFSClient: Failed to close
> file
>
> /apps/hbase/data/WALs/ip-10-0-1-74.ec2.internal,60020,1431910460262/ip-10-0-1-74.ec2.internal%2C60020%2C1431910460262.1431910462148
>
> org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.server.namenode.LeaseExpiredException):
> No lease on
>
> /apps/hbase/data/WALs/ip-10-0-1-74.ec2.internal,60020,1431910460262/ip-10-0-1-74.ec2.internal%2C60020%2C1431910460262.1431910462148:
> File does not exist. Holder
>
> DFSClient_hb_rs_ip-10-0-1-74.ec2.internal,60020,1431910460262_-1181395688_33
> does not have any open files.
> at
>
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(FSNamesystem.java:2946)
> at
>
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFileInternal(FSNamesystem.java:3016)
> at
>
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFile(FSNamesystem.java:2996)
> at
>
> org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.complete(NameNodeRpcServer.java:647)
> at
>
> org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.complete(ClientNamenodeProtocolServerSideTranslatorPB.java:484)
> at
>
> org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java)
> at
>
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)
> at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)
> at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2013)
> at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2009)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
>
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1594)
> at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2007)
>
> at org.apache.hadoop.ipc.Client.call(Client.java:1410)
> at org.apache.hadoop.ipc.Client.call(Client.java:1363)
> at
>
> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206)
> at com.sun.proxy.$Proxy18.complete(Unknown Source)
> at
>
> org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.complete(ClientNamenodeProtocolTranslatorPB.java:404)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
>
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
>
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at
>
> org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:190)
> at
>
> org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:103)
> at com.sun.proxy.$Proxy19.complete(Unknown Source)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
>
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
>
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at org.apache.hadoop.hbase.fs.HFileSystem$1.invoke(HFileSystem.java:272)
> at com.sun.proxy.$Proxy20.complete(Unknown Source)
> at
>
> org.apache.hadoop.hdfs.DFSOutputStream.completeFile(DFSOutputStream.java:2116)
> at org.apache.hadoop.hdfs.DFSOutputStream.close(DFSOutputStream.java:2100)
> at
>
> org.apache.hadoop.hdfs.DFSClient.closeAllFilesBeingWritten(DFSClient.java:850)
> at org.apache.hadoop.hdfs.DFSClient.close(DFSClient.java:867)
> at
>
> org.apache.hadoop.hdfs.DistributedFileSystem.close(DistributedFileSystem.java:857)
> at org.apache.hadoop.fs.FileSystem$Cache.closeAll(FileSystem.java:2488)
> at
>
> org.apache.hadoop.fs.FileSystem$Cache$ClientFinalizer.run(FileSystem.java:2505)
> at java.lang.Thread.run(Thread.java:745)
> 2015-05-18 00:54:22,587 INFO  [Thread-11] regionserver.ShutdownHook:
> Shutdown hook finished.
>
>
>
>
> William Watson
> Software Engineer
> (904) 705-7056 PCS
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message