hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vincent BARAT (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-1029) HBaseStorage is way too slow to be usable
Date Tue, 20 Oct 2009 08:19:59 GMT
HBaseStorage is way too slow to be usable
-----------------------------------------

                 Key: PIG-1029
                 URL: https://issues.apache.org/jira/browse/PIG-1029
             Project: Pig
          Issue Type: Bug
    Affects Versions: 0.4.0
            Reporter: Vincent BARAT


I have performed a set of benchmarks on HBaseStorage loader, using PIG 0.4.0 and HBase 0.20.0
(using the patch referred in https://issues.apache.org/jira/browse/PIG-970) and Hadoop 0.20.0.

The HBaseStorage loader is basically 10x slower than the PigStorage loader.

To bypass this limitation, I had to read my HBase tables, write them to a Hadoop file and
then use this file as input for my subsequent computations.

I report this bug for the track, I will try to sse if I can optimise this a bit.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message