Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: hadoop-user@lucene.apache.org
Received-SPF: pass (athena.apache.org: local policy)
Subject: JRuby + Hadoop
From: Sebastien Rainville <srainville@brightspark.com>
To: hadoop-user@lucene.apache.org
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
Date: Thu, 16 Aug 2007 11:57:09 -0400
Message-Id: <1187279829.6483.63.camel@sebastien>
Mime-Version: 1.0

Hi,

I'm trying to write a map/reduce job in ruby. Does anyone have an
example of the ruby code? I've seen the word count example written in
python but it looks to me that it's a different scenario... where the
code is actually being compiled and put in a jar first.

More precisely, I'm working with nutch which means that from within my
ruby code I need to access data using nutch's java classes. JRuby can
access the Java classes and hadoop-streaming in supposed to be able to
execute map/reduce code written in any language... what I don't
understand is what goes in between (especially how the ruby code would
receive the arguments... for the output I guess it's just a matter of
calling output.collect() like usual).

thx,
Sebastien

p.s. an example in another language is also welcomed... I can translate
it.