hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stack <st...@duboce.net>
Subject Re: Accumulo iterators in HBase
Date Fri, 04 Jul 2014 21:54:48 GMT
On Wed, Jul 2, 2014 at 12:07 AM, Carl Austin <carl.austin@gmail.com> wrote:

> Thanks for the time to look and comment and glad it sounds interesting,
> The reason I started on this was that I'm using Accumulo and want to make
> an application usable on both HBase and Accumulo with the same codebase. I
> do a lot of aggregations on data and I feel the Accumulo iterator mechanism
> is superior for this use case; it's one of the main reasons I went with
> Accumulo and one of the only remaining major differences between the two
> applications now that HBase has implemented cell level ACLs.
> For example, as I am ingesting a main table of data I am creating many
> other question focused tables that keep answers like how many times did I
> see combinations of values, when was the last time I saw combinations
> together, how many distinct values where in this field for each combination
> (using probabilistic counting of course) and many more. All of these things
> are well suited to Accumulo iterators for performance at scale because of
> how they run at compaction time across key/values that are already being
> read at that point, rather than having to update the answers to these
> questions on every single insert.
> This use case won't be for everyone, but the iterator mechanism is pretty
> neat, powerful and a real differentiator in Accumulo (of course there are
> many differentiators in HBase too!).

Thank you for sharing your experience.  I'm watching your repo.  Feel free
to ping me off-list if you want an opinion on how to hbase it or if you
want a review.

Thanks Carl,

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message