hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18541) [C++] Segfaults from JNI
Date Thu, 10 Aug 2017 17:04:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16121916#comment-16121916
] 

Ted Yu commented on HBASE-18541:
--------------------------------

Sometimes the retry-test ended with:
{code}
[ RUN      ] AsyncRpcRetryTest.TestFailWithOperationTimeout
2017-08-10 17:01:54,177 INFO  [RpcServer.FifoWFPBQ.default.handler=1,queue=1,port=41153] master.HMaster
(HMaster.java:createTable(1530)) - Client=root//172.17.0.2 create 'table6', {NAME => 'd',
BLOOMFILTER => 'NONE', VERSIONS => '1', IN_MEMORY => 'false', KEEP_DELETED_CELLS
=> 'FALSE', IN_MEMORY_COMPACTION => 'false', DATA_BLOCK_ENCODING => 'NONE', TTL =>
'FOREVER', COMPRESSION => 'NONE', MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE
=> '65536', REPLICATION_SCOPE => '0'}
2017-08-10 17:01:54,285 INFO  [ProcessThread(sid:0 cport:55375):] server.PrepRequestProcessor
(PrepRequestProcessor.java:pRequest(651)) - Got user-level KeeperException when processing
sessionid:0x15dcd181a4d0000 type:create cxid:0xb5 zxid:0x5a txntype:-1 reqpath:n/a Error Path:/hbase/table-lock/table6
Error:KeeperErrorCode = NoNode for /hbase/table-lock/table6
2017-08-10 17:01:54,495 INFO  [RegionOpenAndInitThread-table6-1] regionserver.HRegion (HRegion.java:createHRegion(6282))
- creating HRegion table6 HTD == 'table6', {NAME => 'd', BLOOMFILTER => 'NONE', VERSIONS
=> '1', IN_MEMORY => 'false', KEEP_DELETED_CELLS => 'FALSE', IN_MEMORY_COMPACTION
=> 'false', DATA_BLOCK_ENCODING => 'NONE', TTL => 'FOREVER', COMPRESSION => 'NONE',
MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE => '65536', REPLICATION_SCOPE
=> '0'} RootDir = file:/usr/src/hbase/hbase-native-client/target/test-data/2db36bc6-e3b0-433a-b812-fc48cd42fd23/.tmp
Table name == table6
2017-08-10 17:01:54,542 INFO  [RegionOpenAndInitThread-table6-1] regionserver.HRegion (HRegion.java:doClose(1590))
- Closed table6,,1502384514176.f9265e2eb45f3087f533c45ab1eaaaa5.
2017-08-10 17:01:54,653 INFO  [ProcedureExecutor-0] hbase.MetaTableAccessor (MetaTableAccessor.java:addRegionsToMeta(1571))
- Added 1
{code}
However, AsyncRpcRetryTest.TestFailWithOperationTimeout passes when run individually.

> [C++] Segfaults from JNI
> ------------------------
>
>                 Key: HBASE-18541
>                 URL: https://issues.apache.org/jira/browse/HBASE-18541
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Enis Soztutar
>            Assignee: Ted Yu
>
> retry-test and multi-retry-test fails flakily when run with 
> {code}
> buck test --all --no-results-cache
> {code}
> or when run in a loop:
> {code}
> for i in `seq 1 10`; do buck test --no-results-cache core:retry-test || break 1; done
> {code}
> The problem seems to be within the JNI internals and usually happens at the create table
method call. I was not able to inspect much, but the comments in our mini-cluster indicate
that we may need to use global references instead of local ones. I suspect the problem happens
when there is a GC run for the test since the failure happens usually after some time (but
almost always in create table method). 
> [~ted_yu] do you mind taking a look at this. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message