hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arun C Murthy <ar...@yahoo-inc.com>
Subject Re: Hadoop MapReduce + MySQL
Date Mon, 07 Jan 2008 03:49:34 GMT
On Sun, Jan 06, 2008 at 04:08:33PM +0100, Fredrik Hedberg wrote:
>In order to simplify some data crunching for a client, I threw
>together some code that allows you to run MapReduce jobs over data in
>a MySQL table.
>The code is heavily inspired by the MapReduce layer for HBase and
>works much like it. However, it's mainly meant to be used for
>development, as in it's current form, but could potentially be of use
>for people that must keep their data in a relational database and
>cannot migrate to HBase for some reason (without all the benefits of
>HBase of course).
>Needless to say, the code is a hack and has a lot of issues. Code is here [1].
>If people find it useful, I can clean it up somewhat and put it in JIRA.

Sure. The best bet is to propose a jira and let your consumers get a shot at it. I'd think
you might get more interesting requirements too. Feel free to publicise the proposal on hadoop-user
if you feel the need to get more eye-balls than on hadoop-dev. Oh, and some documentation
would help! *smile*

Doug - should we put up these in mapred.lib? Come to think of it, I'd say we could move mapred.lib
to contrib and let users go wild with their own mappers/reducers/{input|output}formats etc.;
and encourage them to contribute back. This could help build a nice eco-system around map-reduce,
while offering lesser guarantees about it's feasibility/usability etc. Thoughts? If that makes
sense I'll open a jira for this.


> - Fredrik
>[1] http://www.avafan.com/~fredrik/hadoop/

View raw message