hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ramkrishna.s.vasudevan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-7468) TestSplitTransactionOnCluster hangs frequently
Date Mon, 14 Jan 2013 10:00:15 GMT

    [ https://issues.apache.org/jira/browse/HBASE-7468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13552542#comment-13552542
] 

ramkrishna.s.vasudevan commented on HBASE-7468:
-----------------------------------------------

Ideally the nodeChildrenChange event iterates thro the list of new znodes on the /hbase/assignment
nodes.  And then creates a watcher on that.  One reason could be there are more than one znode
and so the watch setting operation takes time.  The order of execution is different when we
try running from eclipse and when we run mvn tests.  
My conclusion is that the testcase actually reveals the problem but the same can happen in
any case where the SPLITTING event can get missed out.  May be some of the SPLIT related bugs
that were raised is due to this? Need to analyse.

Any suggestions welcome.  We should ensure that the transition from SPLITTING to SPLITTING
should happen only after the master has set the watch on the znode and we should be sure of
that.
                
> TestSplitTransactionOnCluster hangs frequently
> ----------------------------------------------
>
>                 Key: HBASE-7468
>                 URL: https://issues.apache.org/jira/browse/HBASE-7468
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.94.3
>            Reporter: Lars Hofhansl
>            Assignee: ramkrishna.s.vasudevan
>             Fix For: 0.94.5
>
>         Attachments: 7468-0.94.txt, 7468-0.94-v2.txt, 7468-0.94-v4.txt, 7468-jstack.txt,
7468-output.zip, HBASE-7468v3.patch, TEST-org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.xml,
TEST-org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.xml, TEST-org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.xml,
TestSplitTransactionOnCluster-jstack.txt
>
>
> This what I saw once in a local build.
> {code}
> java.lang.Thread.State: TIMED_WAITING (sleeping)
>         at java.lang.Thread.sleep(Native Method)
>         at org.apache.hadoop.hbase.client.HBaseAdmin.disableTable(HBaseAdmin.java:831)
>         at org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.testShouldClearRITWhenNodeFoundInSplittingState(TestSplitTransactionOnCluster.java:650)
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message