hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ed Kohlwey (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (MAPREDUCE-1223) CompositeInputFormat doesn't consider all tuples when run in a local task tracker
Date Fri, 20 Nov 2009 15:03:39 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ed Kohlwey resolved MAPREDUCE-1223.
-----------------------------------

    Resolution: Invalid

After some additional testing I'm marking this as invalid. It appears that the issue was with
one of the inputs not being sorted.

> CompositeInputFormat doesn't consider all tuples when run in a local task tracker
> ---------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1223
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1223
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 0.20.1
>         Environment: Yahoo distribution for Hadoop 0.20.1.3041192001 and Cloudera Distribution
for Hadoop 0.20.1+133
>            Reporter: Ed Kohlwey
>
> The CrossJoin class does not emit all tuples representing the cross product of values
for a given key. The issue only occurs when using the local task tracker, and not when running
the job on a cluster. 
> Example
> {noformat}
> table 1
> k1 -> a
> table 2
> k1 ->c
> k1 ->d
> {noformat}
> The expected output is
> {noformat}
> table 1 inner join table 2
> k1->ac
> k1->ad
> {noformat}
> Instead one gets
> {noformat}
> table 1 inner join table 2
> k1->ac
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message