hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18541) [C++] Segfaults from JNI
Date Mon, 14 Aug 2017 16:09:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16125890#comment-16125890
] 

Ted Yu commented on HBASE-18541:
--------------------------------

Another instance of segfault:
{code}
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x00007fb387315dc8 in os::write_memory_serialize_page (thread=0x2af3000) at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/os.hpp:419
419	/build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/os.hpp: No
such file or directory.
[Current thread is 1 (Thread 0x7fb387dbe840 (LWP 9221))]
Installing openjdk unwinder
(gdb) bt
#0  0x00007fb387315dc8 in ThreadStateTransition::transition_and_fence(JavaThread*, JavaThreadState,
JavaThreadState) (thread=0x2af3000)
    at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/os.hpp:419
#1  0x00007fb387315dc8 in ThreadStateTransition::transition_and_fence(JavaThread*, JavaThreadState,
JavaThreadState) (thread=0x2af3000)
    at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/os/linux/vm/interfaceSupport_linux.hpp:31
#2  0x00007fb387315dc8 in ThreadStateTransition::transition_and_fence(JavaThread*, JavaThreadState,
JavaThreadState) (thread=thread@entry=0x2af3000, to=_thread_in_native, from=_thread_in_vm)
at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/interfaceSupport.hpp:179
#3  0x00007fb38731719f in JVM_FillInStackTrace(JNIEnv*, jobject) (to=_thread_in_native, from=_thread_in_vm,
this=<synthetic pointer>)
    at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/interfaceSupport.hpp:232
#4  0x00007fb38731719f in JVM_FillInStackTrace(JNIEnv*, jobject) (this=<synthetic pointer>,
__in_chrg=<optimized out>)
    at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/runtime/interfaceSupport.hpp:281
#5  0x00007fb38731719f in JVM_FillInStackTrace(JNIEnv*, jobject) (env=<optimized out>,
receiver=receiver@entry=0x7ffde93448a0)
    at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/hotspot/src/share/vm/prims/jvm.cpp:516
#6  0x00007fb38395e851 in Java_java_lang_Throwable_fillInStackTrace (env=<optimized out>,
throwable=0x7ffde93448a0, dummy=<optimized out>)
    at /build/openjdk-8-pZyJp3/openjdk-8-8u131-b11/src/jdk/src/share/native/java/lang/Throwable.c:49
#7  0x00007fb373eb9a28 in [native offset=0xa8] java.lang.Throwable.fillInStackTrace(int) ()
at java/lang/Throwable.java
#8  0x00007fb3743472a4 in [compiled offset=0x84] java.lang.Throwable.fillInStackTrace() ()
at java/lang/Throwable.java:781
#9  0x00007fb3743bc914 in [compiled offset=0x194] java.lang.Throwable.<init>() () at
java/lang/Throwable.java:249
#10 0x00007fb37421a0d4 in [compiled offset=0x1b4] org.apache.log4j.helpers.PatternParser$LocationPatternConverter.convert(org.apache.log4j.spi.LoggingEvent)
()
    at org/apache/log4j/helpers/PatternParser.java:500
#11 0x00007fb37417eab4 in [compiled offset=0x114] org.apache.log4j.helpers.PatternConverter.format(java.lang.StringBuffer,org.apache.log4j.spi.LoggingEvent)
()
    at org/apache/log4j/helpers/PatternConverter.java:65
#12 0x00007fb37426315c in [inlined] java.lang.StringBuffer.setLength(int) () at java/lang/StringBuffer.java:193
0x00007fb37426315c in [compiled offset=0x71c] org.apache.log4j.PatternLayout.format(org.apache.log4j.spi.LoggingEvent)
() at org/apache/log4j/PatternLayout.java:503
#13 0x00007fb37454484c in [compiled offset=0x12c] org.apache.log4j.WriterAppender.subAppend(org.apache.log4j.spi.LoggingEvent)
() at org/apache/log4j/WriterAppender.java:310
#14 0x00007fb374538aac in [compiled offset=0x1ec] org.apache.log4j.WriterAppender.append(org.apache.log4j.spi.LoggingEvent)
() at org/apache/log4j/WriterAppender.java:160
#15 0x00007fb37454793c in [compiled offset=0x113c] org.apache.log4j.AppenderSkeleton.doAppend(org.apache.log4j.spi.LoggingEvent)
()
    at org/apache/log4j/AppenderSkeleton.java:251
#16 0x00007fb374074204 in [compiled offset=0x4c4] org.apache.log4j.helpers.AppenderAttachableImpl.appendLoopOnAppenders(org.apache.log4j.spi.LoggingEvent)
()
    at org/apache/log4j/helpers/AppenderAttachableImpl.java:66
#17 0x00007fb3742b5f24 in [compiled offset=0x1e4] org.apache.log4j.Category.callAppenders(org.apache.log4j.spi.LoggingEvent)
() at org/apache/log4j/Category.java:200
#18 0x00007fb374208d5c in [inlined] org.apache.log4j.Category.forcedLog(java.lang.String,org.apache.log4j.Priority,java.lang.Object,java.lang.Throwable)
()
    at org/apache/log4j/Category.java:392
0x00007fb374208d5c in [compiled offset=0x67c] org.apache.log4j.Category.log(java.lang.String,org.apache.log4j.Priority,java.lang.Object,java.lang.Throwable)
()
    at org/apache/log4j/Category.java:858
#19 0x00007fb37454b374 in [compiled offset=0x154] org.apache.commons.logging.impl.Log4JLogger.info(java.lang.Object)
() at org/apache/commons/logging/impl/Log4JLogger.java:177
#20 0x00007fb373cee042 in [interpreted: bc = 50] org.apache.hadoop.hbase.regionserver.HRegionServer.stop(java.lang.String)
()
    at org/apache/hadoop/hbase/regionserver/HRegionServer.java:1925
{code}
No zookeeper involved. But the line number in os.hpp was the same.

> [C++] Segfaults from JNI
> ------------------------
>
>                 Key: HBASE-18541
>                 URL: https://issues.apache.org/jira/browse/HBASE-18541
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Enis Soztutar
>            Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with 
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the create table
method call. I was not able to inspect much, but the comments in our mini-cluster indicate
that we may need to use global references instead of local ones. I suspect the problem happens
when there is a GC run for the test since the failure happens usually after some time (but
almost always in create table method). 
> [~ted_yu] do you mind taking a look at this. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message