hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: HBase region server failure issues
Date Mon, 14 Apr 2014 21:41:47 GMT
There is on-going effort to address this issue.

See the following:
HBASE-8610 Introduce interfaces to support MultiWAL
HBASE-10378 Divide HLog interface into User and Implementor specific
interfaces

Cheers


On Mon, Apr 14, 2014 at 1:47 PM, Claudiu Soroiu <csoroiu@gmail.com> wrote:

> Hi all,
>
> My name is Claudiu Soroiu and I am new to hbase/hadoop but not new to
> distributed computing in FT/HA environments and I see there are a lot of
> issues reported related to the region server failure.
>
> The main problem I see it is related to recovery time in case of a node
> failure and distributed log splitting. After some tunning I managed to
> reduce it to 8 seconds in total and for the moment it fits the needs.
>
> I have one question: *Why there is only one WAL file per region server and
> not one WAL per region itself? *
> I haven't found the exact answer anywhere, that's why i'm asking on this
> list and please point me to the right direction if i missed the list.
>
> My point is that eliminating the need of splitting a log in case of failure
> reduces the downtime for the regions and the only delay that we will see
> will be related to transferring data over network to the region servers
> that will take over the failed regions.
> This is feasible only if having multiple WAL's per Region Server does not
> affect the overall write performance.
>
> Thanks,
> Claudiu
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message