hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Anoop John <anoop.hb...@gmail.com>
Subject Re: Handling regionserver crashes in production cluster
Date Thu, 06 Jun 2013 04:18:23 GMT
How many total RS in the cluster?  You mean u can not do any operation on
other regions in the live clusters?  It should not happen..  Is it so
happening that the client ops are targetted at the regions which were in
the dead RS( and in transition now)?   Can u have a closer look and see?
If not pls check the RS threads were they are getting blocked.

-Anoop-

On Wed, Jun 5, 2013 at 10:50 PM, kiran <kiran.sarvabhotla@gmail.com> wrote:

> Dear All,
>
> We have production cluster that runs on hbase 0.94.1. The issue we are
> facing is whenever one regionserver goes down, the cluster becomes
> unresponsive until all the regions are allocated to another
> regionserver(s). The transition is taking about 3-5 mins and during this
> time we are unable to any do client operation on the cluster.
>
> Is there any way we can make the transition to run in background ?
>
> Also, it is acceptable for us if the client operations such as scan or get
> does not work on the rowkeys of regions in transition. But, they are not
> working on the entire cluster until all the regions are moved out of
> transition. We can't afford 3-5 minutes of downtime.
>
> --
> Thank you
> Kiran Sarvabhotla
>
> -----Even a correct decision is wrong when it is taken late
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message