hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vincent BARAT (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (PIG-1029) HBaseStorage is way too slow to be usable
Date Thu, 18 Mar 2010 22:40:27 GMT

     [ https://issues.apache.org/jira/browse/PIG-1029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Vincent BARAT resolved PIG-1029.

    Resolution: Duplicate

Ok, I see that some work on HBaseStorage is in progress and is tracked on other issues.
Maybe the caching could be set in the second parameter of the future HBaseSotrage() constructor...

> HBaseStorage is way too slow to be usable
> -----------------------------------------
>                 Key: PIG-1029
>                 URL: https://issues.apache.org/jira/browse/PIG-1029
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>            Reporter: Vincent BARAT
> I have performed a set of benchmarks on HBaseStorage loader, using PIG 0.4.0 and HBase
0.20.0 (using the patch referred in https://issues.apache.org/jira/browse/PIG-970) and Hadoop
> The HBaseStorage loader is basically 10x slower than the PigStorage loader.
> To bypass this limitation, I had to read my HBase tables, write them to a Hadoop file
and then use this file as input for my subsequent computations.
> I report this bug for the track, I will try to sse if I can optimise this a bit.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message