pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitriy V. Ryaboy (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1699) HBaseStorage option -gte resolves to CompareOp.GREATER instead of CompareOp.GREATER_OR_EQUAL
Date Sat, 08 Jan 2011 21:48:46 GMT

    [ https://issues.apache.org/jira/browse/PIG-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12979212#action_12979212
] 

Dmitriy V. Ryaboy commented on PIG-1699:
----------------------------------------

This is actually intentional. 

End keys reported by HBase are not inclusive -- an hbase region contains row keys in the range
[start_key, end_key).  So if endkey == gtevalue, the gte value is NOT in this region.  
This is documented on this wiki page: http://wiki.apache.org/hadoop/Hbase/HbaseArchitecture
.

Did you encounter a situation in which a region was filtered out incorrectly that required
this fix?

-D

> HBaseStorage option -gte resolves to CompareOp.GREATER instead of CompareOp.GREATER_OR_EQUAL
> --------------------------------------------------------------------------------------------
>
>                 Key: PIG-1699
>                 URL: https://issues.apache.org/jira/browse/PIG-1699
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.8.0, 0.9.0
>            Reporter: Jeremy Hinegardner
>            Priority: Minor
>         Attachments: PIG-1699.patch
>
>
> When using HBaseStorage, and using  '-gte'  option, this is passed to  HTableInputFormat,
which then uses CompareOp.GREATER instead of CompareOp.GREATER_OR_EQUAL for split decisions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message