accumulo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Trost <jason.tr...@gmail.com>
Subject Re: Accumulo Hive Storage Handler
Date Sat, 04 May 2013 17:16:43 GMT
Hey Brian,

This is pretty cool.  Just out of curiosity do you have any performance
numbers for this compared to Hive over files or other datastores?  I am
curious how much the iterators speed things with Predicate pushdowns.

Thanks,

--Jason



On Fri, May 3, 2013 at 11:30 PM, Brian Femiano <bfemiano@gmail.com> wrote:

> Use Hive to directly and efficiently query data stored in Accumulo tables.
>
> See the Getting Started Guide and required AUX_JARS list. The homepage also
> lists the current limitations.
>
> I've submitted a patch ACCUMULO-143 to get this directly into Accumulo
> trunk, but for now people can experiment with it at:
> https://github.com/bfemiano/accumulo-hive-storage-manager.
>
> The CREATE EXTERNAL TABLE keywords allows Hive to create a metastore entry
> for the Accumulo table, which 'theoretically' suggests you could use
> Cloudera Impala directly with Accumulo. I have not tested this though.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message