hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Joseph Evans (Resolved) (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (MAPREDUCE-3388) Streaming task with special char gets wrong output
Date Thu, 08 Dec 2011 22:15:40 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Joseph Evans resolved MAPREDUCE-3388.
--------------------------------------------

    Resolution: Invalid

As it turns out this was an issue with our tests which was masked by a small bug in streaming.
 I have filed MAPREDUCE-3521 to address this issue in streaming.
                
> Streaming task with special char gets wrong output 
> ---------------------------------------------------
>
>                 Key: MAPREDUCE-3388
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3388
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/streaming, mrv2
>    Affects Versions: 0.23.0, 0.24.0
>            Reporter: Robert Joseph Evans
>            Assignee: Robert Joseph Evans
>            Priority: Critical
>
> In 0.20.204:
> {noformat}
> hadoop jar $HADOOP_HOME/hadoop-streaming.jar '-Dmapreduce.job.acl-view-job=*' -files
'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt -mapper 'xargs cat' -reducer
cat -output output -jobconf mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
> {noformat}
> Output:
> {noformat}
> hadoop fs -cat output/*
> and-so-is-the-c#
> c++-also-supports-object-oriented-concepts
> hadoop-apache-org-core  
> hadoop-streaming
> hod-is-the-part-of-hadoop
> it-is-residing-on-apache-under-repos-asf
> java-an object-oriented-language
> smalltalk-is-also-object-oriented-language
> streaming-is-also-part-of-it
> {noformat}
> For 0.23:
> {noformat}
> hadoop jar $HADOOP_MAPRED_HOME/hadoop-streaming.jar -Dmapreduce.job.acl-view-job=* -files
'file:///tmp/InputDir#testlink!@$&*()-_+=' -input input.txt  -mapper 'xargs cat' -reducer
cat -output output -jobconf mapred.job.name=streamingTest-1000 -jobconf 'mapreduce.job.acl-view-job=*'
> {noformat}
> Output:
> {noformat}
> testlink!@$&*()-_+=/input1.txt testlink!@$&*()-_+=/input2.txt
> {noformat}
> The contents of input.txt are as follows:
> {noformat}
> hadoop fs -cat Streaming/streaming-1000/input.txt
> testlink!@$&*()-_+=/input1.txt
> testlink!@$&*()-_+=/input2.txt
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message