hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-12450) Unbalance chaos monkey might kill all region servers without starting them back
Date Sat, 08 Nov 2014 00:22:37 GMT

    [ https://issues.apache.org/jira/browse/HBASE-12450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14203001#comment-14203001
] 

Hadoop QA commented on HBASE-12450:
-----------------------------------

{color:red}-1 overall{color}.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12680323/HBASE-12450-0.98.patch
  against trunk revision .
  ATTACHMENT ID: 12680323

    {color:green}+1 @author{color}.  The patch does not contain any @author tags.

    {color:green}+1 tests included{color}.  The patch appears to include 6 new or modified
tests.

    {color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/11617//console

This message is automatically generated.

> Unbalance chaos monkey might kill all region servers without starting them back
> -------------------------------------------------------------------------------
>
>                 Key: HBASE-12450
>                 URL: https://issues.apache.org/jira/browse/HBASE-12450
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Virag Kothari
>            Assignee: Virag Kothari
>            Priority: Minor
>             Fix For: 2.0.0, 0.98.8, 0.99.2
>
>         Attachments: HBASE-12450-0.98.patch, HBASE-12450.patch
>
>
> UnbalanceKillAndRebalanceAction does kill, balance and then start of region servers.
But if the balance fails exception is thrown causing the region servers to not start. For
me, the balance always kept on failing with socket timeout (default 1 min) as master runs
one iteration of balance for 5 mins (default config). Eventually all servers are killed but
never started back.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message