hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "William McNeill (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-2520) InputSampler.RandomSampler only accepts Text keys
Date Thu, 19 May 2011 21:57:48 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13036518#comment-13036518
] 

William McNeill commented on MAPREDUCE-2520:
--------------------------------------------

We're running from a tarball distribution at my organization.

The src softlink in the main Hadoop directory points to a "cloudera/hadoop-0.20.2+320" directory.
 In the 
root directory there are several jar files with names like hadoop-*-0.20.2+320.jar.

> InputSampler.RandomSampler only accepts Text keys
> -------------------------------------------------
>
>                 Key: MAPREDUCE-2520
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2520
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: William McNeill
>            Priority: Minor
>
> I want to do a total sort on some data whose key type is Writable but not Text.  I wrote
an InputSampler.RandomSampler object following the example in the "Total Sort" section of
Hadoop: The Definitive Guide.  When I call InputSampler.writePartitionFile() I get a runtime
class cast exception because my key type cannot be cast to Text.  Specifically the issue seems
to be the following section of InputSampler.getSample():
>     K key = reader.getCurrentKey();
>     ....
>     Text keyCopy = WritableUtils.<Text>clone((Text)key, job.getConfiguration());
> You can only use a RandomSampler on data with Text keys despite the fact that InputSampler
takes <Key, Value> generic parameters.
> InputSampler.getSample() should be changed to cast the key to type K instead of type
Text.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message