hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18541) [C++] Segfaults from JNI
Date Mon, 14 Aug 2017 17:14:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16125998#comment-16125998
] 

Ted Yu commented on HBASE-18541:
--------------------------------

This instance was from netty :
{code}
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x00007f488085a69e in ?? ()
[Current thread is 1 (Thread 0x7f48947fa840 (LWP 6965))]
Installing openjdk unwinder
(gdb) bt
#0  0x00007f488085a69e in  ()
#1  0x00007f4880729d80 in [interpreted: bc = 20] io.netty.channel.nio.NioEventLoop.wakeup(boolean)
() at io/netty/channel/nio/NioEventLoop.java:645
#2  0x00007f4880729ffd in [interpreted: bc = 75] io.netty.util.concurrent.SingleThreadEventExecutor.execute(java.lang.Runnable)
()
    at io/netty/util/concurrent/SingleThreadEventExecutor.java:681
#3  0x00007f488072a042 in [interpreted: bc = 51] org.apache.hadoop.hbase.ipc.AsyncRpcChannelImpl.close(java.lang.Throwable)
()
    at org/apache/hadoop/hbase/ipc/AsyncRpcChannelImpl.java:596
#4  0x00007f488072a042 in [interpreted: bc = 77] org.apache.hadoop.hbase.ipc.AsyncRpcClient.close()
() at org/apache/hadoop/hbase/ipc/AsyncRpcClient.java:346
#5  0x00007f488072a042 in [interpreted: bc = 71] org.apache.hadoop.hbase.client.ConnectionImplementation.close()
()
    at org/apache/hadoop/hbase/client/ConnectionImplementation.java:1911
#6  0x00007f488072a042 in [interpreted: bc = 33] org.apache.hadoop.hbase.HBaseTestingUtility.shutdownMiniCluster()
() at org/apache/hadoop/hbase/HBaseTestingUtility.java:1166
#7  0x00007f48807224e7 in StubRoutines (1) ()
{code}

> [C++] Segfaults from JNI
> ------------------------
>
>                 Key: HBASE-18541
>                 URL: https://issues.apache.org/jira/browse/HBASE-18541
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Enis Soztutar
>            Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with 
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the create table
method call. I was not able to inspect much, but the comments in our mini-cluster indicate
that we may need to use global references instead of local ones. I suspect the problem happens
when there is a GC run for the test since the failure happens usually after some time (but
almost always in create table method). 
> [~ted_yu] do you mind taking a look at this. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message