hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ed Kohlwey (JIRA)" <j...@apache.org>
Subject [jira] Created: (MAPREDUCE-1223) CompositeInputFormat doesn't consider all tuples when run in a local task tracker
Date Thu, 19 Nov 2009 18:55:39 GMT
CompositeInputFormat doesn't consider all tuples when run in a local task tracker
---------------------------------------------------------------------------------

                 Key: MAPREDUCE-1223
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1223
             Project: Hadoop Map/Reduce
          Issue Type: Bug
    Affects Versions: 0.20.1
         Environment: Yahoo distribution for Hadoop 0.20.1.3041192001 and Cloudera Distribution
for Hadoop 0.20.1+133
            Reporter: Ed Kohlwey


The CrossJoin class does not emit all tuples representing the cross product of values for
a given key. The issue only occurs when using the local task tracker, and not when running
the job on a cluster. 

Example
{noformat}
table 1
k1 -> a

table 2
k1 ->c
k1 ->d
{noformat}

The expected output is
{noformat}
table 1 inner join table 2
k1->ac
k1->ad
{noformat}

Instead one gets
{noformat}
table 1 inner join table 2
k1->ac
{noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message