hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sebastiano Di Paola <sebastiano.dipa...@gmail.com>
Subject Re: Need some tutorials for Mapreduce written in Python
Date Wed, 27 Aug 2014 18:28:01 GMT
Hi there,
In order to use Python to write mapreduce jobs you need to use hadoop
streaming api.
So I will suggest start searching for it.
(here's a link although is for hadoop 1.x
http://hadoop.apache.org/docs/r1.2.1/streaming.html ) but it's a starting
point.
With streaming API you can use whatever language to write map/reduce jobs
provided they will expect to read data from stdin and write data to stdout.
Streaming api will do the magic for you ;-)
Hope it helps.
Seba



On Wed, Aug 27, 2014 at 8:13 PM, Amar Singh <amarsingh125@gmail.com> wrote:

> Hi Users,
> I am new to big data world and was in process of reading some material of
> writing mapreduce using Python.
>
> Any links or pointers in that direction will be really helpful.
>

Mime
View raw message