Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 20404 invoked from network); 25 Aug 2010 22:06:57 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 Aug 2010 22:06:57 -0000 Received: (qmail 51554 invoked by uid 500); 25 Aug 2010 22:06:57 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 51542 invoked by uid 500); 25 Aug 2010 22:06:57 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 51534 invoked by uid 99); 25 Aug 2010 22:06:57 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Aug 2010 22:06:57 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Aug 2010 22:06:39 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o7PM6HfL012593 for ; Wed, 25 Aug 2010 22:06:18 GMT Message-ID: <30907028.570121282773977962.JavaMail.jira@thor> Date: Wed, 25 Aug 2010 18:06:17 -0400 (EDT) From: "Stu Hood (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Commented: (CASSANDRA-1368) Add output support for Hadoop Streaming In-Reply-To: <26639801.199901281110656781.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CASSANDRA-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902667#action_12902667 ] Stu Hood commented on CASSANDRA-1368: ------------------------------------- > Is this likely to come up in practice or can we get rid of it? Ack... I don't think it is actually implemented in this patch yet. Without adding it, changing the Avro client API will break Hadoop Streaming clients. I should fix that before we commit. > Add output support for Hadoop Streaming > --------------------------------------- > > Key: CASSANDRA-1368 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1368 > Project: Cassandra > Issue Type: New Feature > Components: Hadoop > Reporter: Stu Hood > Fix For: 0.7 beta 2 > > Attachments: 0001-Switch-to-Cloudera-s-Distribution-of-Hadoop.patch, 0002-Add-an-Avro-OutputReader-and-Resolver-for-Hadoop-Str.patch, 0003-Apply-the-deprecated-OutputFormat-interface-to-allow.patch, 0004-Add-Streaming-example-shell-scripts.patch > > > Hadoop Streaming is a framework that allows mapreduce jobs to be written in languages other than Java, by performing simple IPC on stdin/stdout. > Adding output support for Hadoop Streaming to Cassandra would mean that users could write very simple scripts in dynamic languages to load data into Cassandra. Once our Hadoop OutputFormat has stabilized a bit, we might also be able to this code to provide scalable bulk loading. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.