hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-6330) TestImportExport has been failing against hadoop 0.23/2.0 profile [Part2]
Date Sun, 13 Jan 2013 23:30:12 GMT

    [ https://issues.apache.org/jira/browse/HBASE-6330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13552354#comment-13552354
] 

Ted Yu commented on HBASE-6330:
-------------------------------

I found division by zero error again, see https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/345/testReport/org.apache.hadoop.hbase.mapreduce/TestImportExport/testSimpleCase/
{code}
2013-01-12 11:53:52,809 WARN  [AsyncDispatcher event handler] resourcemanager.RMAuditLogger(255):
USER=jenkins	OPERATION=Application Finished - Failed	TARGET=RMAppManager	RESULT=FAILURE	DESCRIPTION=App
failed with state: FAILED	PERMISSIONS=Application application_1357991604658_0002 failed 1
times due to AM Container for appattempt_1357991604658_0002_000001 exited with  exitCode:
-1000 due to: java.lang.ArithmeticException: / by zero
	at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
	at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
	at org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:279)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:851)

.Failing this attempt.. Failing the application.	APPID=application_1357991604658_0002
{code}
Here is related code:
{code}
        // Keep rolling the wheel till we get a valid path
        Random r = new java.util.Random();
        while (numDirsSearched < numDirs && returnPath == null) {

          long randomPosition = Math.abs(r.nextLong()) % totalAvailable;
{code}
My guess is that totalAvailable was 0, meaning dirDF was empty.

Locally, I saw the following:
{code}
 <testcase time="12.008" classname="org.apache.hadoop.hbase.mapreduce.TestImportExport"
name="testExportScannerBatching">
    <error type="java.lang.reflect.UndeclaredThrowableException">java.lang.reflect.UndeclaredThrowableException
  at org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl.unwrapAndThrowException(YarnRemoteExceptionPBImpl.java:135)
  at org.apache.hadoop.yarn.api.impl.pb.client.ClientRMProtocolPBClientImpl.getNewApplication(ClientRMProtocolPBClientImpl.java:154)
  at org.apache.hadoop.yarn.client.YarnClientImpl.getNewApplication(YarnClientImpl.java:111)
  at org.apache.hadoop.mapred.ResourceMgrDelegate.getNewJobID(ResourceMgrDelegate.java:108)
  at org.apache.hadoop.mapred.YARNRunner.getNewJobID(YARNRunner.java:214)
  at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:345)
  at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1218)
  at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1215)
...
Caused by: com.google.protobuf.ServiceException: java.net.ConnectException: Call From TYus-MacBook-Pro.local/192.168.0.23
to 0.0.0.0:8032 failed on connection exception: java.net.ConnectException: Connection refused;
For more details see:  http://wiki.apache.org/hadoop/ConnectionRefused
  at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:212)
  at $Proxy92.getNewApplication(Unknown Source)
  at org.apache.hadoop.yarn.api.impl.pb.client.ClientRMProtocolPBClientImpl.getNewApplication(ClientRMProtocolPBClientImpl.java:151)
  ... 45 more
Caused by: java.net.ConnectException: Call From TYus-MacBook-Pro.local/192.168.0.23 to 0.0.0.0:8032
failed on connection exception: java.net.ConnectException: Connection refused; For more details
see:  http://wiki.apache.org/hadoop/ConnectionRefused
  at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:722)
  at org.apache.hadoop.ipc.Client.call(Client.java:1168)
  at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:202)
  ... 47 more
Caused by: java.net.ConnectException: Connection refused
  at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
  at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:692)
  at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
  at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:524)
  at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489)
  at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:478)
  at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:573)
  at org.apache.hadoop.ipc.Client$Connection.access$1700(Client.java:220)
  at org.apache.hadoop.ipc.Client.getConnection(Client.java:1217)
  at org.apache.hadoop.ipc.Client.call(Client.java:1144)
{code}
                
> TestImportExport has been failing against hadoop 0.23/2.0 profile [Part2]
> -------------------------------------------------------------------------
>
>                 Key: HBASE-6330
>                 URL: https://issues.apache.org/jira/browse/HBASE-6330
>             Project: HBase
>          Issue Type: Sub-task
>          Components: test
>    Affects Versions: 0.94.1, 0.96.0
>            Reporter: Jonathan Hsieh
>              Labels: hadoop-2.0
>             Fix For: 0.96.0
>
>         Attachments: hbase-6330-94.patch, hbase-6330-trunk.patch, hbase-6330-v2.patch
>
>
> See HBASE-5876.  I'm going to commit the v3 patches under this name since there has been
two months (my bad) since the first half was committed and found to be incomplte.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message