hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dai, Kevin" <yun...@ebay.com>
Subject RE: ResultScanner performance
Date Wed, 27 Aug 2014 03:18:56 GMT
Hi, Ted

We have a cluster of 48 machines and at least 100T data(which is still increasing).
The problem is that we have a lot of row keys (about tens of thousands ) to query in the meantime
and we don't fetch all the data at once, instead we fetch them when needed,
so we may hold tens of thousands ResultScanner in the meantime.
I want to know whether it will hurt the performance and network resources and if so, is there
any way to solve it?

Best regards,
Kevin.
-----Original Message-----
From: Ted Yu [mailto:yuzhihong@gmail.com] 
Sent: 2014年8月26日 16:49
To: user@hbase.apache.org
Cc: user@hbase.apache.org; Huang, Jianshi
Subject: Re: ResultScanner performance

Can you give a bit more detail ?
What size is the cluster / dataset ?
What problem are you solving ?
Would using coprocessor help reduce the usage of ResultScanner ?

Cheers

On Aug 26, 2014, at 12:13 AM, "Dai, Kevin" <yundai@ebay.com> wrote:

> Hi, everyone
> 
> My application will hold tens of thousands of ResultScanner to get Data. Will it hurt
the performance and network resources?
> If so, is there any way to solve it?
> Thanks,
> Kevin.
Mime
View raw message