Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 61EF2D2D5 for ; Wed, 14 Nov 2012 17:42:58 +0000 (UTC) Received: (qmail 57585 invoked by uid 500); 14 Nov 2012 17:42:53 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 57456 invoked by uid 500); 14 Nov 2012 17:42:53 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 57448 invoked by uid 99); 14 Nov 2012 17:42:53 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Nov 2012 17:42:53 +0000 Received: from localhost (HELO mail-ob0-f176.google.com) (127.0.0.1) (smtp-auth username cdouglas, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Nov 2012 17:42:53 +0000 Received: by mail-ob0-f176.google.com with SMTP id un3so787781obb.35 for ; Wed, 14 Nov 2012 09:42:52 -0800 (PST) MIME-Version: 1.0 Received: by 10.60.4.136 with SMTP id k8mr21013573oek.94.1352914972186; Wed, 14 Nov 2012 09:42:52 -0800 (PST) Received: by 10.182.97.164 with HTTP; Wed, 14 Nov 2012 09:42:52 -0800 (PST) In-Reply-To: References: Date: Wed, 14 Nov 2012 09:42:52 -0800 Message-ID: Subject: Re: Hadoop map-side join From: Chris Douglas To: user@hadoop.apache.org Cc: Peter Sheridan , Jim Brooks Content-Type: text/plain; charset=ISO-8859-1 See https://issues.apache.org/jira/browse/MAPREDUCE-355 (not in 1.x series) -C On Tue, Nov 13, 2012 at 8:26 AM, Guang Yang wrote: > Hi, > > I'm trying to use Hadoop map-side join in my application and wondering if > anybody knows if there's a way to use it with the new Hadoop API > ("org.apache.hadoop.mapreduce.*") instead of the old Hadoop API > ("org.apache.hadoop.mapred.*"). The input format I'm trying to use for the > join is "CompositeInputFormat", which is in the old API package and looks > like it expects everything (job configuration, input split, etc) to be from > the old API too. This is a problem for me because I'm using the new API to > create my map/reduce jobs so I can't just use "CompositeInputFormat" as my > job's input format. I wonder if the only way to get the map-side join work > is to use the old API to create map/reduce jobs. I appreciate any response > regarding this issue. > > Thanks, > Guang Yang