hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zak Stone <zst...@gmail.com>
Subject Re: Using Hadoop API through python
Date Fri, 08 May 2009 04:15:31 GMT
You should consider using Dumbo to run Python jobs with Hadoop Streaming:

http://wiki.github.com/klbostee/dumbo

Dumbo is already very useful, and it is improving all the time.

Zak


On Fri, May 8, 2009 at 12:07 AM, Aditya Desai <aditya3889@gmail.com> wrote:
> Hi All,
> Is there any way that I can access the hadoop API through python. I am aware
> that hadoop streaming can be used to create a mapper and reducer in a
> different language but have not come accross any module that helps me apply
> functions to manipulate data or control as is an option in java. First of
> all is it possible to do this. If yes can you please tell me how.
>
> Thanks,
> Aditya.
>
> --
>
> George Burns <http://www.brainyquote.com/quotes/authors/g/george_burns.html>
> - "Happiness is having a large, loving, caring, close-knit family in
> another city."
>

Mime
View raw message