hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ZhuGuanyin (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-5779) KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode free not only support utf8
Date Thu, 07 May 2009 06:50:30 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

ZhuGuanyin updated HADOOP-5779:
-------------------------------

        Fix Version/s:     (was: 0.20.1)
                       0.21.0
          Description: 
1) Currently,  KeyFieldBasedPartitioner only support utf8 encoded recored,  we should use
text or byteswriteable data types.

2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field,
the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting
that record!

  was:
1) Currently,  KeyFieldBasedPartitioner only support utf8 encoded recored,  we should use
text or byteswriteable data types.
2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field,
the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting
that record!

    Affects Version/s: 0.20.0
              Summary: KeyFieldBasedPartitioner would lost data if specifed field not exist,
and it should encode free not only support utf8  (was: KeyFieldBasedPartitioner should encode
free and handle ArrayOutOfIndex exception!)

> KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode
free not only support utf8
> ---------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-5779
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5779
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: ZhuGuanyin
>             Fix For: 0.21.0
>
>         Attachments: encode-free-KeyFieldBasedPartitioner.patch
>
>
> 1) Currently,  KeyFieldBasedPartitioner only support utf8 encoded recored,  we should
use text or byteswriteable data types.
> 2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field,
the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting
that record!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message