Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 63985 invoked from network); 5 Nov 2008 16:30:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Nov 2008 16:30:19 -0000 Received: (qmail 31866 invoked by uid 500); 5 Nov 2008 16:30:23 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 31822 invoked by uid 500); 5 Nov 2008 16:30:22 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 31811 invoked by uid 99); 5 Nov 2008 16:30:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Nov 2008 08:30:22 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Nov 2008 16:29:02 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 77FEF234C27E for ; Wed, 5 Nov 2008 08:29:44 -0800 (PST) Message-ID: <118905884.1225902584490.JavaMail.jira@brutus> Date: Wed, 5 Nov 2008 08:29:44 -0800 (PST) From: "Klaas Bosteels (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-4304) Add Dumbo to contrib In-Reply-To: <945254768.1222699544445.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-4304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12645249#action_12645249 ] Klaas Bosteels commented on HADOOP-4304: ---------------------------------------- For people interested in Dumbo: I've started using github again as a public code repository for now, since it seems to take a while to get this committed: http://github.com/klbostee/dumbo > Add Dumbo to contrib > -------------------- > > Key: HADOOP-4304 > URL: https://issues.apache.org/jira/browse/HADOOP-4304 > Project: Hadoop Core > Issue Type: New Feature > Reporter: Klaas Bosteels > Assignee: Klaas Bosteels > Priority: Minor > Attachments: hadoop-4304-v2.patch, hadoop-4304-v3.patch, hadoop-4304.patch > > > Originally, Dumbo was a simple Python module developed at Last.fm to make writing and running Hadoop Streaming programs very easy, but now it also consists of some (up till now unreleased) helper code in Java (although it can still be used without the Java code). We propose to add Dumbo to "src/contrib" such that the Java classes get build/installed together with the rest of Hadoop, and the Python module can be installed separately at will. A tar.gz of the directory that would have to be added to "src/contrib" is available at > http://static.last.fm/dumbo/dumbo-contrib.tar.gz > and more info about Dumbo can be found here: > * Basic documentation: http://github.com/klbostee/dumbo/wikis > * Presentation at HUG (where it was first suggested to add Dumbo to contrib): http://skillsmatter.com/podcast/home/dumbo-hadoop-streaming-made-elegant-and-easy > * Initial announcement: http://blog.last.fm/2008/05/29/python-hadoop-flying-circus-elephant > For some of the more advanced features of Dumbo (in particular the ones for which the Java classes are needed) there is no public documentation yet, but we could easily fill that gap by moving some of the internal Last.fm documentation to the Hadoop wiki. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.