hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alex Baranau <alex.barano...@gmail.com>
Subject Re: Parallel scans
Date Wed, 01 Aug 2012 13:23:31 GMT
> Is there a way to execute multiple scans in parallel like get?

I guess the Q is can we (and does it makes sense) to execute multiple scans
in parallel, e.g. in multiple threads inside the client. The answer is yes,
you can do it and it makes sense: HBase is likely to be able to process
much more requests in parallel than you have clients (depends on your
clients number of course, but I assume you don't have more that several,
incl. MR jobs).

Alex Baranau
------
Sematext :: http://blog.sematext.com/ :: Hadoop - HBase - ElasticSearch -
Solr

On Tue, Jul 31, 2012 at 3:27 PM, Tom Brown <tombrown52@gmail.com> wrote:

> I think you could do it manually by looking up all the different
> regions and starting a separate scan for each region. Not quite as
> handy as the built-in multi get, but essentially the same.
>
> Of course, that leaves the question of processing-- If you're
> processing it in a single-threaded environment, HBase is unlikely to
> be the bottleneck. If your sending each scan to multiple processors,
> this could be a significant speedup.
>
> --Tom
>
> On Mon, Jul 30, 2012 at 11:34 PM, Bertrand Dechoux <dechouxb@gmail.com>
> wrote:
> > Hi,
> >
> > Are you talking about as coprocessor or MapReduce input? If it is the
> first
> > then it is up to you (the client). If it is the latter I am not sure that
> > -if scans were changed to be parallel (assuming they are sequential now)-
> > the whole job would be noticeably faster. But I am interested in an
> answer
> > too.
> >
> > Regards
> >
> > Bertrand
> >
> > On Tue, Jul 31, 2012 at 2:56 AM, Mohit Anchlia <mohitanchlia@gmail.com
> >wrote:
> >
> >> Is there a way to execute multiple scans in parallel like get?
> >>
> >
> >
> >
> > --
> > Bertrand Dechoux
>



-- 
Alex Baranau
------
Sematext :: http://blog.sematext.com/ :: Hadoop - HBase - ElasticSearch -
Solr

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message