hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Premal Shah <premal.j.s...@gmail.com>
Subject Re: Command line args to streaming scripts
Date Wed, 10 Aug 2011 21:00:35 GMT
i tried dumbo. my input is a log file. dumbo was splitting each log line by
spaces while passing me the input. that was totally weird. i would have
expected it to just split the file by line breaks.
will try -cmdenv

thanx

On Wed, Aug 10, 2011 at 1:26 PM, Harsh J <harsh@cloudera.com> wrote:

> Perhaps you can use -cmdenv (environment variables) instead?
>
>
> http://hadoop.apache.org/common/docs/r0.20.2/streaming.html#Specifying+Additional+Configuration+Variables+for+Jobs
>
> Btw, if you are using Python, I suggest taking a look at Dumbo. Things
> are a lot more easier with it. Dumbo is at http://last.fm/dumbo
>
> On Thu, Aug 11, 2011 at 1:37 AM, Premal Shah <premal.j.shah@gmail.com>
> wrote:
> > Is it possible to pass command line arguments to streaming scripts?
> > eg: python mapper.py --match=2
> >
> > can i pass match=2 using a streaming command to mapper.py?
> >
> > --
> > Regards,
> > Premal Shah.
> >
>
>
>
> --
> Harsh J
>



-- 
Regards,
Premal Shah.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message