Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 32065 invoked from network); 16 Aug 2007 15:58:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Aug 2007 15:58:01 -0000 Received: (qmail 75972 invoked by uid 500); 16 Aug 2007 15:57:57 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 75948 invoked by uid 500); 16 Aug 2007 15:57:56 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 75939 invoked by uid 99); 16 Aug 2007 15:57:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2007 08:57:56 -0700 X-ASF-Spam-Status: No, hits=2.6 required=10.0 tests=RCVD_NUMERIC_HELO,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [209.171.54.152] (HELO cx295.800onemail.com) (209.171.54.152) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2007 15:57:52 +0000 Received: from cx11.800onemail.com (cx11.800onemail.com [192.168.162.107]) by cx295.800onemail.com (8.13.1/8.13.1) with ESMTP id l7GFvGr2001384 for ; Thu, 16 Aug 2007 11:57:16 -0400 Received: from CX43.800onemail.com ([192.168.162.108]) by cx11.800onemail.com with Microsoft SMTPSVC(6.0.3790.1830); Thu, 16 Aug 2007 11:56:33 -0400 Received: from 207.115.108.202 ([207.115.108.202]) by CX43.800onemail.com ([192.168.162.108]) via Exchange Front-End Server rpc.ceryx.com ([192.168.162.106]) with Microsoft Exchange Server HTTP-DAV ; Thu, 16 Aug 2007 15:56:32 +0000 Received: from sebastien by rpc.ceryx.com; 16 Aug 2007 11:57:09 -0400 Subject: JRuby + Hadoop From: Sebastien Rainville To: hadoop-user@lucene.apache.org Content-Type: text/plain Content-Transfer-Encoding: 7bit Date: Thu, 16 Aug 2007 11:57:09 -0400 Message-Id: <1187279829.6483.63.camel@sebastien> Mime-Version: 1.0 X-Mailer: Evolution 2.10.1 X-OriginalArrivalTime: 16 Aug 2007 15:56:33.0643 (UTC) FILETIME=[04BC2BB0:01C7E01E] X-CRXEFW-Info: Please contact Ceryx for more information X-CRXEFW-Virus: Clean X-CRXEFW-From: srainville@brightspark.com X-Virus-Checked: Checked by ClamAV on apache.org Hi, I'm trying to write a map/reduce job in ruby. Does anyone have an example of the ruby code? I've seen the word count example written in python but it looks to me that it's a different scenario... where the code is actually being compiled and put in a jar first. More precisely, I'm working with nutch which means that from within my ruby code I need to access data using nutch's java classes. JRuby can access the Java classes and hadoop-streaming in supposed to be able to execute map/reduce code written in any language... what I don't understand is what goes in between (especially how the ruby code would receive the arguments... for the output I guess it's just a matter of calling output.collect() like usual). thx, Sebastien p.s. an example in another language is also welcomed... I can translate it.