hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-4690) Intermittent TestRegionServerCoprocessorExceptionWithAbort#testExceptionFromCoprocessorDuringPut failure
Date Sun, 30 Oct 2011 05:05:32 GMT

    [ https://issues.apache.org/jira/browse/HBASE-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13139543#comment-13139543
] 

Ted Yu commented on HBASE-4690:
-------------------------------

It is pretty clear what happened in build 2384. The failure was because regions brought online
wasn't in the same order as start keys are defined:
{code}
2011-10-29 21:45:43,789 INFO  [RS_OPEN_REGION-hemera.apache.org,37045,1319924731557-0] regionserver.HRegion(502):
Onlined observed_table,kkk,1319924743536.d2bb03652b0e69a4a192be3b60f6cd78.; next sequenceid=1
...
2011-10-29 21:45:43,883 DEBUG [Thread-183] hbase.HBaseTestingUtility(1129): Found 25 rows
for table observed_table
2011-10-29 21:45:43,883 DEBUG [Thread-183] hbase.HBaseTestingUtility(1132): FirstRow=observed_table,,1319924743504.ed6d9b9f5122809fad16e61835367b48.
2011-10-29 21:45:43,887 INFO  [RS_OPEN_REGION-hemera.apache.org,37045,1319924731557-1] regionserver.HRegion(502):
Onlined observed_table,lll,1319924743540.ac163536355dbe1ab71ab1a9ee7a22d4.; next sequenceid=1
...
2011-10-29 21:45:43,950 DEBUG [main-EventThread] zookeeper.ZKUtil(228): master:34047-0x13351a50a270000
Set watcher on existing znode /hbase/unassigned/ed6d9b9f5122809fad16e61835367b48
...
2011-10-29 21:45:44,050 INFO  [RS_OPEN_REGION-hemera.apache.org,45759,1319924731527-0] regionserver.HRegion(502):
Onlined observed_table,,1319924743504.ed6d9b9f5122809fad16e61835367b48.; next sequenceid=1
{code}
We can see the ~170ms delay between the discovery of region 1319924743504.ed6d9b9f5122809fad16e61835367b48.
and its actual online.

A simple patch would be to give getRSForFirstRegionInTable() some time if index returned by
hbaseCluster.getServerWith() was -1.
                
> Intermittent TestRegionServerCoprocessorExceptionWithAbort#testExceptionFromCoprocessorDuringPut
failure
> --------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-4690
>                 URL: https://issues.apache.org/jira/browse/HBASE-4690
>             Project: HBase
>          Issue Type: Test
>    Affects Versions: 0.92.0
>            Reporter: Ted Yu
>            Assignee: Eugene Koontz
>             Fix For: 0.92.0
>
>
> See https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/83/testReport/junit/org.apache.hadoop.hbase.coprocessor/TestRegionServerCoprocessorExceptionWithAbort/testExceptionFromCoprocessorDuringPut/
> Somehow getRSForFirstRegionInTable() wasn't able to retrieve the region server.
> One fix for this issue is to spin up MiniCluster with 1 region server so that we don't
need to search for the region server where first region is hosted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message