Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id BF0EC20049B for ; Mon, 14 Aug 2017 19:14:06 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id BD91B163A63; Mon, 14 Aug 2017 17:14:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1787516378B for ; Mon, 14 Aug 2017 19:14:05 +0200 (CEST) Received: (qmail 79038 invoked by uid 500); 14 Aug 2017 17:14:02 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 79027 invoked by uid 99); 14 Aug 2017 17:14:02 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Aug 2017 17:14:02 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 38EC01A0B63 for ; Mon, 14 Aug 2017 17:14:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id qmVTYcMdLs40 for ; Mon, 14 Aug 2017 17:14:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 0A6375F238 for ; Mon, 14 Aug 2017 17:14:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8B779E08F5 for ; Mon, 14 Aug 2017 17:14:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 3ECED2140C for ; Mon, 14 Aug 2017 17:14:00 +0000 (UTC) Date: Mon, 14 Aug 2017 17:14:00 +0000 (UTC) From: "Ted Yu (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-18541) [C++] Segfaults from JNI MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 14 Aug 2017 17:14:06 -0000 [ https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16125998#comment-16125998 ] Ted Yu commented on HBASE-18541: -------------------------------- This instance was from netty : {code} Program terminated with signal SIGSEGV, Segmentation fault. #0 0x00007f488085a69e in ?? () [Current thread is 1 (Thread 0x7f48947fa840 (LWP 6965))] Installing openjdk unwinder (gdb) bt #0 0x00007f488085a69e in () #1 0x00007f4880729d80 in [interpreted: bc = 20] io.netty.channel.nio.NioEventLoop.wakeup(boolean) () at io/netty/channel/nio/NioEventLoop.java:645 #2 0x00007f4880729ffd in [interpreted: bc = 75] io.netty.util.concurrent.SingleThreadEventExecutor.execute(java.lang.Runnable) () at io/netty/util/concurrent/SingleThreadEventExecutor.java:681 #3 0x00007f488072a042 in [interpreted: bc = 51] org.apache.hadoop.hbase.ipc.AsyncRpcChannelImpl.close(java.lang.Throwable) () at org/apache/hadoop/hbase/ipc/AsyncRpcChannelImpl.java:596 #4 0x00007f488072a042 in [interpreted: bc = 77] org.apache.hadoop.hbase.ipc.AsyncRpcClient.close() () at org/apache/hadoop/hbase/ipc/AsyncRpcClient.java:346 #5 0x00007f488072a042 in [interpreted: bc = 71] org.apache.hadoop.hbase.client.ConnectionImplementation.close() () at org/apache/hadoop/hbase/client/ConnectionImplementation.java:1911 #6 0x00007f488072a042 in [interpreted: bc = 33] org.apache.hadoop.hbase.HBaseTestingUtility.shutdownMiniCluster() () at org/apache/hadoop/hbase/HBaseTestingUtility.java:1166 #7 0x00007f48807224e7 in StubRoutines (1) () {code} > [C++] Segfaults from JNI > ------------------------ > > Key: HBASE-18541 > URL: https://issues.apache.org/jira/browse/HBASE-18541 > Project: HBase > Issue Type: Sub-task > Reporter: Enis Soztutar > Assignee: Ted Yu > > retry-test and multi-retry-test fails flakily when run with > {code} > buck test --all --no-results-cache > {code} > or when run in a loop: > {code} > for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break 1; done > {code} > The problem seems to be within the JNI internals and usually happens at the create table method call. I was not able to inspect much, but the comments in our mini-cluster indicate that we may need to use global references instead of local ones. I suspect the problem happens when there is a GC run for the test since the failure happens usually after some time (but almost always in create table method). > [~ted_yu] do you mind taking a look at this. -- This message was sent by Atlassian JIRA (v6.4.14#64029)