hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Koch <ogd...@googlemail.com>
Subject Re: One Region Server fails - all M/R jobs crash.
Date Fri, 22 Nov 2013 17:35:56 GMT
Here you go:

Task log: http://pastebin.com/VePTLHEk
Region Server log: http://pastebin.com/iu8y0VYL


On Fri, Nov 22, 2013 at 6:27 PM, Ted Yu <yuzhihong@gmail.com> wrote:

> Attachment didn't go through.
>
> Can you pastebin their contents ?
>
> Thanks
>
> On Nov 23, 2013, at 12:55 AM, David Koch <ogdude@googlemail.com> wrote:
>
> > Sorry for the previous message, I attach the equired log files.
> >
> > Regards,
> >
> > David
> >
> >
> > On Fri, Nov 22, 2013 at 5:53 PM, David Koch <ogdude@googlemail.com>
> wrote:
> >>
> >>
> >>
> >> On Fri, Nov 22, 2013 at 4:17 PM, Ted Yu <yuzhihong@gmail.com> wrote:
> >>> Can you pastebin snippet of:
> >>> 1. task logs which show failure
> >>> 2. region server log shortly before the crash
> >>>
> >>> Thanks
> >>>
> >>>
> >>> On Fri, Nov 22, 2013 at 7:14 AM, David Koch <ogdude@googlemail.com>
> wrote:
> >>>
> >>> > Hello,
> >>> >
> >>> > We experience reliability problems when running M/R jobs over HBase
> tables.
> >>> > Specifically, it suffices for one Region Server to crash in order to
> fail
> >>> > all M/R jobs.
> >>> >
> >>> > My guess is that this is not normal with a replication factor of 3.
> >>> >
> >>> > The HBase version is 0.94.6 installed as part of of Cloudera 4.4.
> HBase
> >>> > settings are pre-sets. Cluster size is 30 machines.
> >>> >
> >>> > What steps can I follow to improve the situation?
> >>> >
> >>> > Thank you,
> >>> >
> >>> > /David
> >>> >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message