Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 30012 invoked from network); 13 Jul 2010 08:28:26 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 13 Jul 2010 08:28:26 -0000 Received: (qmail 8850 invoked by uid 500); 13 Jul 2010 08:28:25 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 8586 invoked by uid 500); 13 Jul 2010 08:28:22 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 8578 invoked by uid 99); 13 Jul 2010 08:28:21 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Jul 2010 08:28:21 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.215.44] (HELO mail-ew0-f44.google.com) (209.85.215.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Jul 2010 08:28:13 +0000 Received: by ewy22 with SMTP id 22so1237778ewy.31 for ; Tue, 13 Jul 2010 01:27:04 -0700 (PDT) MIME-Version: 1.0 Received: by 10.103.5.20 with SMTP id h20mr269728mui.120.1279009624184; Tue, 13 Jul 2010 01:27:04 -0700 (PDT) Received: by 10.103.173.3 with HTTP; Tue, 13 Jul 2010 01:27:04 -0700 (PDT) In-Reply-To: References: Date: Tue, 13 Jul 2010 10:27:04 +0200 Message-ID: Subject: Re: CassandraBulkLoader From: Torsten Curdt To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org On Tue, Jul 13, 2010 at 04:35, Mubarak Seyed wrote: > Where can i find the documentation for BinaryMemTable (btm_example in contrib) > to use CassandraBulkLoader? What is the input to be supplied to CassandraBulkLoader? > How to form the input data and what is the format of an input data? The code is the documentation I fear. I'll see if I get permission to get our updated code contributed. We added command line fu and using it to import large TSVs. > Do i need the HDFS to store my storage-conf.xml? Why HDFS? The machine running the bulk loader joins the cassandra ring kind of like a temporary node. So you will need the storage-conf.xml on that machine. cheers -- Torsten