hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrey Stepachev (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-13574) Broken TestHBaseFsck in master
Date Mon, 27 Apr 2015 15:21:39 GMT

    [ https://issues.apache.org/jira/browse/HBASE-13574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14514275#comment-14514275
] 

Andrey Stepachev commented on HBASE-13574:
------------------------------------------

It seems that test is broken due of the new create retry policy in hadoop 2.6. 
Namenode proxy now created with custom RetryPolicy for AlreadyBeingCreatedException which
is implies timeout on this operations up to HdfsConstants.LEASE_SOFTLIMIT_PERIOD (60seconds).
So we should expect that create can execute as minimum as this constant if file already exists.
So for now we can fix that by hardcoding greater timeouts to HBaseFsck (for now they 
are 15 seconds in test and 30seconds in HbaseFsck itself).
This change should not break production usage of HBaseFsck, except of it will take longer
to
find out that other instance of HbaseFsck is running.

> Broken TestHBaseFsck in master
> ------------------------------
>
>                 Key: HBASE-13574
>                 URL: https://issues.apache.org/jira/browse/HBASE-13574
>             Project: HBase
>          Issue Type: Bug
>          Components: test
>    Affects Versions: 2.0.0
>            Reporter: Andrey Stepachev
>            Priority: Critical
>
> Got following exception and it reproducible (I can see it in recent tests runs from other
patches).
> {noformat}
> Running org.apache.hadoop.hbase.util.TestHBaseFsck
> Tests run: 51, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 348.628 sec <<<
FAILURE! - in org.apache.hadoop.hbase.util.TestHBaseFsck
> testParallelWithRetriesHbck(org.apache.hadoop.hbase.util.TestHBaseFsck)  Time elapsed:
30.052 sec  <<< ERROR!
> java.util.concurrent.ExecutionException: java.io.IOException: Duplicate hbck - Abort
> 	at java.util.concurrent.FutureTask.report(FutureTask.java:122)
> 	at java.util.concurrent.FutureTask.get(FutureTask.java:188)
> 	at org.apache.hadoop.hbase.util.TestHBaseFsck.testParallelWithRetriesHbck(TestHBaseFsck.java:634)
> Caused by: java.io.IOException: Duplicate hbck - Abort
> 	at org.apache.hadoop.hbase.util.HBaseFsck.connect(HBaseFsck.java:473)
> 	at org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:53)
> 	at org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:43)
> 	at org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.doFsck(HbckTestingUtil.java:38)
> 	at org.apache.hadoop.hbase.util.TestHBaseFsck$2RunHbck.call(TestHBaseFsck.java:625)
> 	at org.apache.hadoop.hbase.util.TestHBaseFsck$2RunHbck.call(TestHBaseFsck.java:621)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> 	at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message