Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 55465 invoked from network); 6 May 2010 14:40:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 May 2010 14:40:08 -0000 Received: (qmail 20861 invoked by uid 500); 6 May 2010 14:40:07 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 20805 invoked by uid 500); 6 May 2010 14:40:07 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 20797 invoked by uid 99); 6 May 2010 14:40:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 May 2010 14:40:07 +0000 X-ASF-Spam-Status: No, hits=1.1 required=10.0 tests=AWL,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [218.102.48.211] (HELO imsm058dat.netvigator.com) (218.102.48.211) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 May 2010 14:40:01 +0000 Received: from X200 ([218.102.118.30]) by imsm058dat.netvigator.com (InterMail vM.7.05.01.01 201-2174-106-103-20060222) with ESMTP id <20100506143937.MKQB5094.imsm058dat.netvigator.com@X200> for ; Thu, 6 May 2010 22:39:37 +0800 From: "Dop Sun" To: References: <007e01caed1a$e3969670$aac3c350$@com> In-Reply-To: Subject: RE: How to initialize the Cassandra Date: Thu, 6 May 2010 22:39:31 +0800 Message-ID: <009201caed29$f147b150$d3d713f0$@com> MIME-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcrtKBIAIgmIPJPoS+iba8jS8uSnogAAV0Hw Content-Language: en-us Is this feedback for my request on initializing data for Cassandra? :-) Basically, my question is whether there is a tool, like json2sstable, = which can import some data in a format which can be easily manual created. I = don't know the experience of other people, but what I'm currently doing is = writing a small application to write the data in. Dop -----Original Message----- From: Jonathan Ellis [mailto:jbellis@gmail.com]=20 Sent: Thursday, May 06, 2010 10:25 PM To: user@cassandra.apache.org Subject: Re: How to initialize the Cassandra The simplest way is to just use thrift batch_mutate. If Cassandra CPU is your bottleneck then using the binary load method from StorageProxy can help (see contrib/bmt_example). If Casssandra disk or network is your bottleneck then binary load won't really help. On Thu, May 6, 2010 at 7:51 AM, Dop Sun wrote: > Hi, > > > > I just discovered that the json file exported by sstable2json contains more > than the data itself, like deletedAt values. > > > > I=92m thinking whether there is a tool can import some initial data? > > > > When we are doing the typical RDBMS system, this is how we are doing: > > 1)=A0=A0=A0=A0=A0 Define the schema > > 2)=A0=A0=A0=A0=A0 Initialize the database with initial data using = Insert statements > > 3)=A0=A0=A0=A0=A0 Running the application > > > > If following these steps in Cassandra: > > 1)=A0=A0=A0=A0=A0 Define the schema: this is done by define the = KeySpace > > 2)=A0=A0=A0=A0=A0 Initialize the data with initial data: =A0This is = currently not there. > We can import data exported from existing installation, but not from nothing > > 3)=A0=A0=A0=A0=A0 Running the application > > > > Thanks, > > Regards, > > Dop --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com