hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Enis Soztutar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-5986) Clients can see holes in the META table when regions are being split
Date Sat, 26 May 2012 02:09:24 GMT

    [ https://issues.apache.org/jira/browse/HBASE-5986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13283881#comment-13283881
] 

Enis Soztutar commented on HBASE-5986:
--------------------------------------

Here are the test results for 0.94: 
{code}
Tests run: 551, Failures: 0, Errors: 0, Skipped: 0
...
Tests run: 932, Failures: 1, Errors: 2, Skipped: 9

Failed tests:   testShutdownSimpleFixup(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
expected:<1> but was:<0>

Tests in error: 
  testDelayedRpcImmediateReturnValue(org.apache.hadoop.hbase.ipc.TestDelayedRpc): Call to
/127.0.0.1:53586 failed on socket timeout exception: java.net.SocketTimeoutException: 1000
millis timeout while waiting for channel to be ready for read. ch : java.nio.channels.SocketChannel[connected
local=/127.0.0.1:53623 remote=/127.0.0.1:53586]
  testLocalHBaseCluster(org.apache.hadoop.hbase.TestLocalHBaseCluster): Master not initialized
after 200 seconds
{code}

I rerun the tests locally with success, except TestLocalHBaseCluster. But it fails on 0.94
HEAD as well for me. 

For 0.92:
{code}

Results :

Failed tests:   testMultipleResubmits(org.apache.hadoop.hbase.master.TestSplitLogManager)
  testcomputeHDFSBlocksDistribution(org.apache.hadoop.hbase.util.TestFSUtils)

Tests in error:
  testClusterRestart(org.apache.hadoop.hbase.master.TestRestartCluster): org.apache.hadoop.hbase.PleaseHoldException:
Master is initializing
  testWholesomeSplit(org.apache.hadoop.hbase.regionserver.TestSplitTransaction): Failed delete
of /homes/hortonde/enis/code/hbase-0.92/target/test-data/af023188-0b23-4f9d-a9bc-a074e94e57f8/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/table/7c59b6677ad46bf3f652a83de1e62bcb
  testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction): Target HLog directory
already exists: /homes/hortonde/enis/code/hbase-0.92/target/test-data/af023188-0b23-4f9d-a9bc-a074e94e57f8/org.apache.hadoop.hbase.regionserver.TestSplitTransaction/logs
  testRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransaction)
  loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadSequential): test timed out
after 120000 milliseconds
  loadTest[0](org.apache.hadoop.hbase.util.TestMiniClusterLoadParallel): test timed out after
120000 milliseconds

Tests run: 1135, Failures: 2, Errors: 6, Skipped: 8
{code} 

Also run those failed tests locally with success. It seems we can go ahead with 0.92 and 0.94
if you don't have any concerns. 
                
> Clients can see holes in the META table when regions are being split
> --------------------------------------------------------------------
>
>                 Key: HBASE-5986
>                 URL: https://issues.apache.org/jira/browse/HBASE-5986
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.92.1, 0.96.0, 0.94.1
>            Reporter: Enis Soztutar
>            Assignee: Enis Soztutar
>             Fix For: 0.96.0
>
>         Attachments: 5986-v2.txt, HBASE-5986-0.92.patch, HBASE-5986-0.94.patch, HBASE-5986-test_v1.patch,
HBASE-5986_v3.patch
>
>
> We found this issue when running large scale ingestion tests for HBASE-5754. The problem
is that the .META. table updates are not atomic while splitting a region. In SplitTransaction,
there is a time lap between the marking the parent offline, and adding of daughters to the
META table. This can result in clients using MetaScanner, of HTable.getStartEndKeys (used
by the TableInputFormat) missing regions which are made just offline, but the daughters are
not added yet. 
> This is also related to HBASE-4335. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message