hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: One Region Server fails - all M/R jobs crash.
Date Fri, 22 Nov 2013 17:27:32 GMT
Attachment didn't go through. 

Can you pastebin their contents ?

Thanks

On Nov 23, 2013, at 12:55 AM, David Koch <ogdude@googlemail.com> wrote:

> Sorry for the previous message, I attach the equired log files.
> 
> Regards,
> 
> David
> 
> 
> On Fri, Nov 22, 2013 at 5:53 PM, David Koch <ogdude@googlemail.com> wrote:
>> 
>> 
>> 
>> On Fri, Nov 22, 2013 at 4:17 PM, Ted Yu <yuzhihong@gmail.com> wrote:
>>> Can you pastebin snippet of:
>>> 1. task logs which show failure
>>> 2. region server log shortly before the crash
>>> 
>>> Thanks
>>> 
>>> 
>>> On Fri, Nov 22, 2013 at 7:14 AM, David Koch <ogdude@googlemail.com> wrote:
>>> 
>>> > Hello,
>>> >
>>> > We experience reliability problems when running M/R jobs over HBase tables.
>>> > Specifically, it suffices for one Region Server to crash in order to fail
>>> > all M/R jobs.
>>> >
>>> > My guess is that this is not normal with a replication factor of 3.
>>> >
>>> > The HBase version is 0.94.6 installed as part of of Cloudera 4.4. HBase
>>> > settings are pre-sets. Cluster size is 30 machines.
>>> >
>>> > What steps can I follow to improve the situation?
>>> >
>>> > Thank you,
>>> >
>>> > /David
>>> >
> 

Mime
  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message