Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1C41B461E for ; Mon, 23 May 2011 13:25:29 +0000 (UTC) Received: (qmail 53471 invoked by uid 500); 23 May 2011 13:25:29 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 53451 invoked by uid 500); 23 May 2011 13:25:28 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 53443 invoked by uid 99); 23 May 2011 13:25:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 May 2011 13:25:28 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 May 2011 13:25:27 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id E1FC5D97BF for ; Mon, 23 May 2011 13:24:47 +0000 (UTC) Date: Mon, 23 May 2011 13:24:47 +0000 (UTC) From: "Hudson (JIRA)" To: commits@cassandra.apache.org Message-ID: <1089994091.35926.1306157087922.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (CASSANDRA-1278) Make bulk loading into Cassandra less crappy, more pluggable MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-1278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037933#comment-13037933 ] Hudson commented on CASSANDRA-1278: ----------------------------------- Integrated in Cassandra-0.8 #124 (See [https://builds.apache.org/hudson/job/Cassandra-0.8/124/]) Add sstable bulk loading utility patch by slebresne; reviewed by jbellis for CASSANDRA-1278 slebresne : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1126477 Files : * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/utils/BloomFilter.java * /cassandra/branches/cassandra-0.8/CHANGES.txt * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/StorageServiceMBean.java * /cassandra/branches/cassandra-0.8/bin/sstableloader * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/tools/BulkLoader.java * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/streaming/OperationType.java * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/io/sstable/SSTableReader.java * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/io/sstable/SSTableLoader.java * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/streaming/StreamOutSession.java * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/io/sstable/SSTable.java * /cassandra/branches/cassandra-0.8/src/java/org/apache/cassandra/service/StorageService.java > Make bulk loading into Cassandra less crappy, more pluggable > ------------------------------------------------------------ > > Key: CASSANDRA-1278 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1278 > Project: Cassandra > Issue Type: Improvement > Components: Tools > Reporter: Jeremy Hanna > Assignee: Sylvain Lebresne > Fix For: 0.8.1 > > Attachments: 0001-Add-bulk-loader-utility-v2.patch, 0001-Add-bulk-loader-utility.patch, 1278-cassandra-0.7-v2.txt, 1278-cassandra-0.7.1.txt, 1278-cassandra-0.7.txt > > Original Estimate: 40h > Time Spent: 40h 40m > Remaining Estimate: 0h > > Currently bulk loading into Cassandra is a black art. People are either directed to just do it responsibly with thrift or a higher level client, or they have to explore the contrib/bmt example - http://wiki.apache.org/cassandra/BinaryMemtable That contrib module requires delving into the code to find out how it works and then applying it to the given problem. Using either method, the user also needs to keep in mind that overloading the cluster is possible - which will hopefully be addressed in CASSANDRA-685 > This improvement would be to create a contrib module or set of documents dealing with bulk loading. Perhaps it could include code in the Core to make it more pluggable for external clients of different types. > It is just that this is something that many that are new to Cassandra need to do - bulk load their data into Cassandra. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira