Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E620510064 for ; Tue, 2 Jul 2013 16:22:23 +0000 (UTC) Received: (qmail 58695 invoked by uid 500); 2 Jul 2013 16:22:19 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 58608 invoked by uid 500); 2 Jul 2013 16:22:18 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 58590 invoked by uid 99); 2 Jul 2013 16:22:18 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Jul 2013 16:22:18 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of chloe.guszo@gmail.com designates 74.125.82.49 as permitted sender) Received: from [74.125.82.49] (HELO mail-wg0-f49.google.com) (74.125.82.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Jul 2013 16:22:12 +0000 Received: by mail-wg0-f49.google.com with SMTP id a12so4837746wgh.28 for ; Tue, 02 Jul 2013 09:21:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=PWyfBdyGuHdlYG4NBJzM80KAJUFana6plwe9MBSz7Pk=; b=flXQ2aZPtlR1GNM4ubtwQAsHGmPaTjaNrFkBJmiTiL9rE+CpUVg3fhmvuRu8nSEIyB BQAIIVtxmsdDYvNCb1/UzjEiIxyMPT4XygY7h5LB8j8/EusAYy4GUpd+lpD5IRlZtLN8 Id6BGMHqSx5Qe/I7CCZXL2CGZtRXtBADx/JtHLSvmYkzgBvE0c4YHD97IpyjVCaJpwtR 4tFKeGHwR11Zubr7DyHE5ZGPUYgza+IyiAQI71l2bFnQ/8imobUZNh2FtC2G3J4Ci7p0 waNCf4vSy9XdyRpZh+fkZBy34uZnuCw6mlTEUTjmp6G6UXMbUWsOhFrdlhOxGo1SlQ1w +b8Q== MIME-Version: 1.0 X-Received: by 10.180.20.228 with SMTP id q4mr15957013wie.60.1372782111967; Tue, 02 Jul 2013 09:21:51 -0700 (PDT) Received: by 10.194.16.170 with HTTP; Tue, 2 Jul 2013 09:21:51 -0700 (PDT) Date: Tue, 2 Jul 2013 12:21:51 -0400 Message-ID: Subject: Custom JoinRecordReader class From: Chloe Guszo To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=f46d0421848523536104e089be27 X-Virus-Checked: Checked by ClamAV on apache.org --f46d0421848523536104e089be27 Content-Type: text/plain; charset=ISO-8859-1 Hi all, I would like some help/direction on implementing a custom join class. I believe this is the way to address my task at hand, which is given 2 matrices in SequenceFile format, I wish to run operations on all pairs of rows between them. The rows may not be equal in number. The actual operations will be taken care of in Mahout. I wrote a custom class working off of InnerJoinRecordReader and OuterJoinRecordReader but they of course always get fed and thus return pairs of keys that match. How can I get a return of all key pairs? Or does this go completely against the hadoop map-reduce framework? Thanks in advance for any input. --f46d0421848523536104e089be27 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi all,

I would like some help/directio= n on implementing a custom join class. I believe this is the way to address= my task at hand, which is given 2 matrices in SequenceFile format, I wish = to run operations on all pairs of rows between them. The rows may not be eq= ual in number. The actual operations will be taken care of in Mahout.=A0

I wrote a custom class working off of InnerJoinRecordRe= ader and OuterJoinRecordReader but they of course always get fed and thus r= eturn pairs of keys that match. How can I get a return of all key pairs? Or= does this go completely against the hadoop map-reduce framework?

Thanks in advance for any input.
--f46d0421848523536104e089be27--