hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-5528) Binary partitioner
Date Fri, 20 Mar 2009 20:08:50 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-5528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12684013#action_12684013
] 

Owen O'Malley commented on HADOOP-5528:
---------------------------------------

I understand now. right is defined as bytes from the right to ignore. Unfortunately that means
you *can't* use it to pick up bytes 4-8 if you don't know the length. How about using python
style offsets where negative numbers means count from the right. That will allow a lot more
flexibility.

0 = start of bytes
1 = after first byte
-2 = before last byte
-1 = end of bytes

So left=4, right=8 would use bytes 4-8 from the right.
Left=-5, right=-1 would use the last 4 bytes.

> Binary partitioner
> ------------------
>
>                 Key: HADOOP-5528
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5528
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: mapred
>            Reporter: Klaas Bosteels
>            Assignee: Klaas Bosteels
>         Attachments: HADOOP-5528.patch
>
>
> It would be useful to have a {{BinaryPartitioner}} that partitions {{BinaryComparable}}
keys by hashing a configurable part of the bytes array corresponding to each key.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message