Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B81BC9281 for ; Mon, 28 Nov 2011 18:54:03 +0000 (UTC) Received: (qmail 23495 invoked by uid 500); 28 Nov 2011 18:54:03 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 23473 invoked by uid 500); 28 Nov 2011 18:54:03 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 23465 invoked by uid 99); 28 Nov 2011 18:54:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Nov 2011 18:54:03 +0000 X-ASF-Spam-Status: No, hits=-2001.2 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Nov 2011 18:54:01 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 7CE8CA4FE1 for ; Mon, 28 Nov 2011 18:53:40 +0000 (UTC) Date: Mon, 28 Nov 2011 18:53:40 +0000 (UTC) From: "Brandon Williams (Commented) (JIRA)" To: commits@cassandra.apache.org Message-ID: <1541342986.18866.1322506420512.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <61025347.43761.1313545287173.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (CASSANDRA-3045) Update ColumnFamilyOutputFormat to use new bulkload API MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CASSANDRA-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158631#comment-13158631 ] Brandon Williams commented on CASSANDRA-3045: --------------------------------------------- bq. How do you configure BOF vs CFOF? By calling setOutputFormatClass on the job. bq. Why do we need to keep CFOF around? I can think of two reasons: firstly, by removing it, we break every existing job. This is pretty easy for users to fix though, as indicated above. Secondly, someone might want access to each individual record as soon as possible, rather than waiting for the entire job to finish and stream a bunch of sstables. It's a latency vs throughput tradeoff. > Update ColumnFamilyOutputFormat to use new bulkload API > ------------------------------------------------------- > > Key: CASSANDRA-3045 > URL: https://issues.apache.org/jira/browse/CASSANDRA-3045 > Project: Cassandra > Issue Type: Improvement > Components: Hadoop > Reporter: Jonathan Ellis > Assignee: Brandon Williams > Priority: Minor > Fix For: 1.1 > > Attachments: 0001-Remove-gossip-SS-requirement-from-BulkLoader.txt, 0002-Allow-DD-loading-without-yaml.txt, 0003-hadoop-output-support-for-bulk-loading.txt > > > The bulk loading interface added in CASSANDRA-1278 is a great fit for Hadoop jobs. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira