accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Carl Austin (JIRA)" <>
Subject [jira] [Commented] (ACCUMULO-143) Accumulo Hive
Date Wed, 18 Jun 2014 08:18:05 GMT


Carl Austin commented on ACCUMULO-143:

Thanks [~elserj], great to know that there is going to be more progress on this. Coincidentally
I think I've recently got INSERT working too, also mostly untested, but I'll take a look at
what you've done for comparison sake when I get a minute.
I noticed that this patch removes a lot of the parsing from the record reader, as well as
the blank initialisation, something I had to also do to get any type of performance at scale.
That combined with the column fetch has significantly improved the overall performance of
this (a count(col) on 11 million values is down from 10s of minutes to a couple on my small
test cluster), and I'm going to be looking at whether I can eek any more speed out of it in
the coming week or so, as well as testing how it compares to plain file based external tables.
If I find anything more I'll let you know.

> Accumulo Hive
> -------------
>                 Key: ACCUMULO-143
>                 URL:
>             Project: Accumulo
>          Issue Type: Task
>          Components: contrib
>    Affects Versions: 1.6.0
>            Reporter: Keith Turner
>         Attachments: ACCUMULO-143.patch
> Need to look into adding support for Accumulo to Hive

This message was sent by Atlassian JIRA

View raw message