Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <A8B921227BEEA7429E36512A6D1A259E17CC5AF6@MBX021-E3-NJ-2.exch021.domain.local>
References: 
 <A8B921227BEEA7429E36512A6D1A259E17CC5AF6@MBX021-E3-NJ-2.exch021.domain.local>
Date: Wed, 14 Nov 2012 09:42:52 -0800
Message-ID: 
 <CACO5Y4xf3669_9t-5BRsH75He52C4t6jO8u=pmR4NYjRk_SxaA@mail.gmail.com>
Subject: Re: Hadoop map-side join
From: Chris Douglas <cdouglas@apache.org>
To: user@hadoop.apache.org
Cc: Peter Sheridan <psheridan@millennialmedia.com>,
 Jim Brooks <jbrooks@millennialmedia.com>
Content-Type: text/plain; charset=ISO-8859-1

See https://issues.apache.org/jira/browse/MAPREDUCE-355 (not in 1.x series) -C

On Tue, Nov 13, 2012 at 8:26 AM, Guang Yang <gyang@millennialmedia.com> wrote:
> Hi,
>
> I'm trying to use Hadoop map-side join in my application and wondering if
> anybody knows if there's a way to use it with the new Hadoop API
> ("org.apache.hadoop.mapreduce.*") instead of the old Hadoop API
> ("org.apache.hadoop.mapred.*"). The input format I'm trying to use for the
> join is "CompositeInputFormat", which is in the old API package and looks
> like it expects everything (job configuration, input split, etc) to be from
> the old API too. This is a problem for me because I'm using the new API to
> create my map/reduce jobs so I can't just use "CompositeInputFormat" as my
> job's input format. I wonder if the only way to get the map-side join work
> is to use the old API to create map/reduce jobs. I appreciate any response
> regarding this issue.
>
> Thanks,
> Guang Yang