hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-8888) Tweak retry settings some more, *some more*
Date Mon, 08 Jul 2013 21:47:50 GMT

     [ https://issues.apache.org/jira/browse/HBASE-8888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

stack updated HBASE-8888:
-------------------------

    Attachment: 8888v2.txt

Main change is upping retries from 14 to 31 and a change of the
RETRY_BACKUP array so we ramp up quickly to retrying every ten seconds.

M hbase-client/src/main/java/org/apache/hadoop/hbase/client/ServerCallable.java
  Print out elapsed time over all retries.  Helps figuring where we
  are time-wise retrying.

M hbase-client/src/test/java/org/apache/hadoop/hbase/client/TestClientNoCluster.java
  Utility for checking our retry.  Off by default since it a 'failing' test.

M hbase-client/src/test/resources/hbase-site.xml
M hbase-server/src/test/resources/hbase-site.xml
  Rely on default retries rather than have custom ones for tests only.

M hbase-common/src/main/java/org/apache/hadoop/hbase/HConstants.java
  Change RETRY_BACKUP so it ramps up quickly to 100 * pause.  Set default
  retries to be 31.
                
> Tweak retry settings some more, *some more*
> -------------------------------------------
>
>                 Key: HBASE-8888
>                 URL: https://issues.apache.org/jira/browse/HBASE-8888
>             Project: HBase
>          Issue Type: Bug
>            Reporter: stack
>            Assignee: stack
>             Fix For: 0.95.2
>
>         Attachments: 8888.txt, 8888v2.txt
>
>
> Follow on from hbase-8776.
> Need to fix retries and timeouts.  We cut them down so much hbase-it tests fail.
> From https://issues.apache.org/jira/browse/HBASE-8776?focusedCommentId=13698762&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13698762
@nkeywal says:
> {code}
> I would like to change
> hbase.client.retries.number -> 30 (instead of 14 or 20 today)
> hbase.client.pause -> 500 (instead of 100 or 1000 today).
> Context: see HBASE-6295.
> As well, would it make sense to remove all the hbase-site.xml and hbase-defaults.xml
to rely only on the defaults in the code. This would trigger another set of issues, as sometimes
the defaults are duplicated and different. But these are bugs as well. Imho, this duplication
is confusing and it leads to unreliable behavior as we don't really know what are the setting
actually used.
> {code}
> Regards removing hbase-site.xml from everywhere to rely on defaults in code, over in
hbase-8776 I tried removing them and way too many tests failed.  Looks like it'd be tough
removing them.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message