hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-12725) Insufficient wait in TestMultiParallel#testFlushCommitsWithAbort might lead to failed assertion on server count
Date Fri, 19 Dec 2014 04:26:14 GMT

     [ https://issues.apache.org/jira/browse/HBASE-12725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ted Yu updated HBASE-12725:
---------------------------
    Attachment: 12725-0.98.txt

> Insufficient wait in TestMultiParallel#testFlushCommitsWithAbort might lead to failed
assertion on server count
> ---------------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-12725
>                 URL: https://issues.apache.org/jira/browse/HBASE-12725
>             Project: HBase
>          Issue Type: Test
>            Reporter: Ted Yu
>            Priority: Minor
>         Attachments: 12725-0.98.txt
>
>
> From https://builds.apache.org/job/hbase-0.98/750/testReport/org.apache.hadoop.hbase.client/TestMultiParallel/testFlushCommitsWithAbort/
:
> {code}
> 2014-12-19 01:49:57,877 FATAL [Thread-677] regionserver.HRegionServer(1886): ABORTING
region server priapus.apache.org,50706,1418953791906: Aborting for tests
> java.lang.Exception: doTestFlushCommits
>   at org.apache.hadoop.hbase.client.TestMultiParallel.doTestFlushCommits(TestMultiParallel.java:268)
>   at org.apache.hadoop.hbase.client.TestMultiParallel.testFlushCommitsWithAbort(TestMultiParallel.java:240)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>   at java.lang.reflect.Method.invoke(Method.java:597)
>   at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>   at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>   at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
> 2014-12-19 01:49:57,877 FATAL [Thread-677] regionserver.HRegionServer(1894): RegionServer
abort: loaded coprocessors are: [org.apache.hadoop.hbase.coprocessor.                   MultiRowMutationEndpoint]
> 2014-12-19 01:49:57,917 INFO  [Thread-677] client.TestMultiParallel(287): validating
loaded data
> 2014-12-19 01:49:58,275 INFO  [Thread-677] client.TestMultiParallel(295): Count=1, Alive=priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,275 INFO  [Thread-677] client.TestMultiParallel(295): Count=2, Alive=priapus.apache.org,41103,1418953791972
> 2014-12-19 01:49:58,275 INFO  [Thread-677] client.TestMultiParallel(295): Count=3, Alive=priapus.apache.org,33781,1418953792016
> 2014-12-19 01:49:58,275 INFO  [Thread-677] client.TestMultiParallel(295): Count=4, Alive=priapus.apache.org,43381,1418953792058
> 2014-12-19 01:49:58,276 INFO  [Thread-677] client.TestMultiParallel(295): Count=5, Alive=priapus.apache.org,54398,1418953792103
> 2014-12-19 01:49:58,276 INFO  [Thread-677] client.TestMultiParallel(297): Count=5
> 2014-12-19 01:49:58,289 INFO  [RS:0;priapus:50706.periodicFlusher] hbase.Chore(93): RS:0;priapus:50706.periodicFlusher
exiting
> 2014-12-19 01:49:58,290 INFO  [RS:0;priapus:50706.leaseChecker] regionserver.Leases(147):
RS:0;priapus:50706.leaseChecker closing leases
> 2014-12-19 01:49:58,290 INFO  [RS:0;priapus:50706.leaseChecker] regionserver.Leases(150):
RS:0;priapus:50706.leaseChecker closed leases
> 2014-12-19 01:49:58,291 INFO  [RS:0;priapus:50706] regionserver.CompactSplitThread(360):
Waiting for Split Thread to finish...
> 2014-12-19 01:49:58,291 INFO  [RS:0;priapus:50706] regionserver.CompactSplitThread(360):
Waiting for Merge Thread to finish...
> 2014-12-19 01:49:58,291 INFO  [RS:0;priapus:50706] regionserver.CompactSplitThread(360):
Waiting for Large Compaction Thread to finish...
> 2014-12-19 01:49:58,291 INFO  [RS:0;priapus:50706] regionserver.CompactSplitThread(360):
Waiting for Small Compaction Thread to finish...
> 2014-12-19 01:49:58,295 DEBUG [RS:0;priapus:50706-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:50706-0x14a603c969c0001, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/replication/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,295 INFO  [RS:0;priapus:50706] client.HConnectionManager$HConnectionImplementation(1893):
Closing zookeeper sessionid=0x14a603c969c0010
> 2014-12-19 01:49:58,302 DEBUG [RS:4;priapus:54398-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:54398-0x14a603c969c0004, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,302 DEBUG [RS:2;priapus:33781-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:33781-0x14a603c969c0002, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,302 DEBUG [RS:0;priapus:50706-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:50706-0x14a603c969c0001, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,302 DEBUG [RS:3;priapus:43381-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:43381-0x14a603c969c0005, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,302 DEBUG [main-EventThread] zookeeper.ZooKeeperWatcher(312): master:48016-0x14a603c969c0000,
quorum=localhost:53936, baseZNode=/hbase Received ZooKeeper     Event, type=NodeDeleted, state=SyncConnected,
path=/hbase/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,302 DEBUG [RS:1;priapus:41103-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:41103-0x14a603c969c0003, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/rs/priapus.apache.org,50706,1418953791906
> 2014-12-19 01:49:58,303 INFO  [main-EventThread] zookeeper.RegionServerTracker(118):
RegionServer ephemeral node deleted, processing expiration [priapus.apache.org,50706,    
   1418953791906]
> 2014-12-19 01:49:58,302 DEBUG [RS:0;priapus:50706-EventThread] zookeeper.ZooKeeperWatcher(312):
regionserver:50706-0x14a603c969c0001, quorum=localhost:53936, baseZNode=/hbase    Received
ZooKeeper Event, type=NodeChildrenChanged, state=SyncConnected, path=/hbase/rs
> 2014-12-19 01:49:58,309 INFO  [RS:0;priapus:50706] regionserver.HRegionServer(1065):
stopping server priapus.apache.org,50706,1418953791906; zookeeper connection closed.
> 2014-12-19 01:49:58,309 INFO  [RS:0;priapus:50706] regionserver.HRegionServer(1068):
RS:0;priapus:50706 exiting
> {code}
> region server priapus.apache.org,50706,1418953791906 abort was initiated @ 01:49:57,877
> Check for live region server count was performed @ 01:49:58,275
> However, region server exited @ 01:49:58,309
> Here is the code in doTestFlushCommits() of 0.98:
> {code}
>       while (liveRS.getRegionServer().getNumberOfOnlineRegions() != 0) {
>         Thread.sleep(10);
> {code}
> In master branch, the sleep duration is 100



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message