hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jean-Daniel Cryans" <jdcry...@apache.org>
Subject Re: low performance on hadoop
Date Fri, 07 Nov 2008 14:46:51 GMT
Jarod,

Some informations are missing. Is it only 1 table you have? How did you try
to count the number of rows the first time? If you are using only 1 client,
scanning is sequential so it's normal to have only 1 region server taking
all the hits at a time. Maybe try using the RowCounter mapreduce job
provided with HBase?

Also, try to upgrade to HBase 0.18.1 because Hadoop 0.18 is a bit faster.

J-D

On Fri, Nov 7, 2008 at 3:21 AM, Jarod Feng <jarodf@gmail.com> wrote:

>
> Hi ,
> I'm using hbase0.2.1 + hadoop 0.17 with 11 client & 1master servers
> I crawl some data and save them to hbase with map & reduce.
> after that , I try to count the total number of some data.
> it too slow , you can see the monitor , 1 of clients have a high request ,
> but others is 0.
>
> S1:60020 1226025034218 requests: 0 regions: 5
> S2:60020 1226025034364 requests: 0 regions: 5
> S3:60020 1226025033874 requests: 0 regions: 5
> S4:60020 1226025035074 requests: 5085 regions: 4
> S5:60020 1226025034712 requests: 0 regions: 5
> S6:60020 1226025034716 requests: 0 regions: 5
> S7:60020 1226025034280 requests: 0 regions: 4
> S8:60020 1226025034130 requests: 0 regions: 5
> S9:60020 1226025033726 requests: 0 regions: 5
> S10:60020 1226025034539 requests: 0 regions: 5
> S11:60020 1226025034528 requests: 0 regions: 4
>
> I don't know the reason , and when I use shell for count or use single
> process without map-reduce , it's faster
>
> I have use hadoop for many month , but this is my first time to develop
> based hbase
>
>
> Thanks,
> --
> View this message in context:
> http://www.nabble.com/low-performance-on-hadoop-tp20376261p20376261.html
> Sent from the HBase User mailing list archive at Nabble.com.
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message