Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 40A4C103E2 for ; Tue, 5 Nov 2013 17:16:24 +0000 (UTC) Received: (qmail 55964 invoked by uid 500); 5 Nov 2013 17:16:24 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 55919 invoked by uid 500); 5 Nov 2013 17:16:24 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 55911 invoked by uid 99); 5 Nov 2013 17:16:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Nov 2013 17:16:23 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of josh.elser@gmail.com designates 209.85.220.176 as permitted sender) Received: from [209.85.220.176] (HELO mail-vc0-f176.google.com) (209.85.220.176) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Nov 2013 17:16:16 +0000 Received: by mail-vc0-f176.google.com with SMTP id ia6so5815776vcb.35 for ; Tue, 05 Nov 2013 09:15:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=eZrVYZIlStuZ5k+tNc0kL/bZ37jgubll0Rb/qF5pC1g=; b=ffyx89NpiEO54NTkPYEjgq2qshxYm5pOxrBoFWPtdvUqFR/vUR1DTtJRSqvLzfyevc CBXtxJglemMJ8mR2jbzzKkxoRUvHpi5smWLUxHrWEFaQg1f5MC57EllALiXVXNvqhNE+ 9I9w8Ry2MHC+BejOaxWTYdJ7LvXN/dzW/iMyUqAd17nUIPg/Kiu9wtNy1nBIrOS2Mgk6 phuk44VIYI5y2hP1KgDzZ048WaaWENNrpVJzRjbBECGjEgX9rCzZhKnDt9vWITUbgkbt CPCR0+auSAVWd6iTglghr67hYIaj+gp1wFt9B4D9vwPxMnNyIDLzKGoy0YrMKeVF0BOO NW1A== X-Received: by 10.58.136.231 with SMTP id qd7mr16276336veb.1.1383671755351; Tue, 05 Nov 2013 09:15:55 -0800 (PST) Received: from HW10447.local (pool-173-69-177-34.bltmmd.fios.verizon.net. [173.69.177.34]) by mx.google.com with ESMTPSA id c16sm33072124vdj.12.2013.11.05.09.15.54 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 05 Nov 2013 09:15:54 -0800 (PST) Message-ID: <527927CA.7030500@gmail.com> Date: Tue, 05 Nov 2013 12:15:54 -0500 From: Josh Elser User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:24.0) Gecko/20100101 Thunderbird/24.1.0 MIME-Version: 1.0 To: dev@accumulo.apache.org Subject: Re: MultipleInputs with AccumuloInputFormat References: <52790DC1.1030708@gmail.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Heh, ok. I'm currently working through a bit of a prototype to see how it works. I'm not a mapred/mapreduce expert, but I *think* I have an approach that will work. Keep an eye out for a Jira -- would love feedback. On 11/5/13, 12:13 PM, Kevin Faro wrote: > I recently looked into that and came to the same realization. > > I ended up writing a new input format that did the cartesian product of two > tables. But to do that I had to store values for the left configuration > and right configuration and then copy over whichever config settings I > wanted to use for the AIF depending on which split i needed in the > RecordReader. > > It would have been awesome if I could have just used the MultipleInputs ... > > --Kevin > > > On Tue, Nov 5, 2013 at 10:24 AM, Josh Elser wrote: > >> In executing some MapReduce over Accumulo with the AccumuloInputFormat, I >> came to the realization that AIF fundamentally doesn't work with concepts >> like MultipleInputs in Hadoop (http://hadoop.apache.org/ >> docs/current/api/org/apache/hadoop/mapreduce/lib/input/MultipleInputs.html). >> Given that you can only write one set of configuration for AIF into a >> Configuration object, there's not a mechanism to support multiple. This >> appears to be the case across all versions. >> >> Is this correct? Have I overlooked something? >> >