hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-17400) TestHRegion#testWritesWhileScanning occasionally hangs in MVCC#.waitForRead()
Date Sun, 01 Jan 2017 02:46:58 GMT

    [ https://issues.apache.org/jira/browse/HBASE-17400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15790411#comment-15790411
] 

Ted Yu commented on HBASE-17400:
--------------------------------

Looking at waitForRead() :
{code}
    synchronized (readWaiters) {
      while (readPoint.get() < e.getWriteNumber()) {
        if (count % 100 == 0 && count > 0) {
          LOG.warn("STUCK: " + this + ", waiting for " + e.getWriteNumber());
        }
        count++;
        try {
          readWaiters.wait(10);
        } catch (InterruptedException ie) {
          // We were interrupted... finish the loop -- i.e. cleanup --and then
          // on our way out, reset the interrupt flag.
          interrupted = true;
        }
      }
    }
{code}
I wonder what the cleanup refers to.
I only see readWaiters.wait() call.

> TestHRegion#testWritesWhileScanning occasionally hangs in MVCC#.waitForRead()
> -----------------------------------------------------------------------------
>
>                 Key: HBASE-17400
>                 URL: https://issues.apache.org/jira/browse/HBASE-17400
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Ted Yu
>         Attachments: testHRegionWithInMemoryFlush-0e4866.out
>
>
> Looping TestHRegionWithInMemoryFlush with commit 0e48665641b16cd9b250503696b926a568063654
, I got the following error at iteration #33:
> {code}
> Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 600.163 sec <<<
FAILURE! - in org.apache.hadoop.hbase.regionserver.TestHRegionWithInMemoryFlush
> org.apache.hadoop.hbase.regionserver.TestHRegionWithInMemoryFlush  Time elapsed: 600.019
sec  <<< ERROR!
> org.junit.runners.model.TestTimedOutException: test timed out after 10 minutes
>   at java.lang.Object.wait(Native Method)
>   at org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForRead(MultiVersionConcurrencyControl.java:218)
>   at org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.completeAndWait(MultiVersionConcurrencyControl.java:149)
>   at org.apache.hadoop.hbase.regionserver.HRegion.getNextSequenceId(HRegion.java:2731)
>   at org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2446)
>   at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2342)
>   at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2313)
>   at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2303)
>   at org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1600)
>   at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1505)
>   at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1455)
>   at org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:374)
>   at org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileScanning(TestHRegion.java:3985)
> {code}
> See attached test output for details.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message