pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitriy V. Ryaboy (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (PIG-1828) HBaseStorage has problems with processing multiregion tables
Date Sat, 05 Mar 2011 23:45:46 GMT

     [ https://issues.apache.org/jira/browse/PIG-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Dmitriy V. Ryaboy resolved PIG-1828.

    Resolution: Fixed

Fixed as part of PIG-1680.

> HBaseStorage has problems with processing multiregion tables
> ------------------------------------------------------------
>                 Key: PIG-1828
>                 URL: https://issues.apache.org/jira/browse/PIG-1828
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.8.0
>         Environment: Hadoop 0.20.2, Hbase 0.20.6, Distributed mode
>            Reporter: Lukas
>            Assignee: Dmitriy V. Ryaboy
> As brought up in the pig user mailing list (http://www.mail-archive.com/user%40pig.apache.org/msg00606.html)
Pig does sometime not scan the full HBase table.
> It seems that HBaseStorage has problems scanning large tables. It issues just one mapper
job instead of one mapper job per table region.
> Ian Stevens, who brought this issue up in the mailing list, attached a script to reproduce
the problem (https://gist.github.com/766929).
> However, in my case, the problem only occurred, after the table was split into more than
one regions.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message