hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nathan Marz <nathan.m...@gmail.com>
Subject Re: cascalog and amazon emr
Date Thu, 16 Jun 2011 06:34:40 GMT
Using Cascalog from the REPL with EMR is straightforward. First, you'll want
to run a job flow in persistent mode so that it acts like a normal hadoop
cluster. Then, upload your jobjar to the master machine. On the master
machine, run "hadoop jar {jobjar} clojure.lang.Repl" and you can run queries
from there. Just remember that any custom operations (def*ops) need to be
compiled into the jobjar already.

-Nathan


-- 
Twitter: @nathanmarz
http://nathanmarz.com


On Wed, Jun 15, 2011 at 2:21 PM, hiroprotagonist <axue@lumoslabs.com> wrote:

> hey,
>
> been playing around with cascalog recently and was just wondering if anyone
> had any advice on how to set it up to work with EMR?
>
> is there some way to have the interactive REPL interface set up so that you
> can play around the way you can with the interactive HIVE CLI?
>
> thanks!
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/cascalog-and-amazon-emr-tp3069655p3069655.html
> Sent from the Hadoop lucene-users mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message