hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: One Region Server fails - all M/R jobs crash.
Date Fri, 22 Nov 2013 15:17:43 GMT
Can you pastebin snippet of:
1. task logs which show failure
2. region server log shortly before the crash

Thanks


On Fri, Nov 22, 2013 at 7:14 AM, David Koch <ogdude@googlemail.com> wrote:

> Hello,
>
> We experience reliability problems when running M/R jobs over HBase tables.
> Specifically, it suffices for one Region Server to crash in order to fail
> all M/R jobs.
>
> My guess is that this is not normal with a replication factor of 3.
>
> The HBase version is 0.94.6 installed as part of of Cloudera 4.4. HBase
> settings are pre-sets. Cluster size is 30 machines.
>
> What steps can I follow to improve the situation?
>
> Thank you,
>
> /David
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message