Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7A6EC9D28 for ; Fri, 23 Sep 2011 07:59:59 +0000 (UTC) Received: (qmail 51296 invoked by uid 500); 23 Sep 2011 07:59:57 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 51257 invoked by uid 500); 23 Sep 2011 07:59:57 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 51233 invoked by uid 99); 23 Sep 2011 07:59:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Sep 2011 07:59:56 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of roshandawrani@gmail.com designates 209.85.161.44 as permitted sender) Received: from [209.85.161.44] (HELO mail-fx0-f44.google.com) (209.85.161.44) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Sep 2011 07:59:51 +0000 Received: by fxd18 with SMTP id 18so3729024fxd.31 for ; Fri, 23 Sep 2011 00:59:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=HURj3wAENgq7Omb9FnZmPwEoN3SpPM2Cbxmod4jR3B0=; b=vlbJkU12vYJh+OpSjsXKz4JvH1F1F7Cw2WOv8n6LJFScWldxVZL+Y6n+WWx/1lyh9V eHg5RMDWIxQ6xPsegblcsqnge8a78nTXxfP1uD/twqbUeDOGUeC32aDiVyLdBiIRfU1v LFyElMP/k1TsyqT3cT3c6qymTZ84bOXMaBoT8= MIME-Version: 1.0 Received: by 10.223.35.18 with SMTP id n18mr4581557fad.105.1316764769549; Fri, 23 Sep 2011 00:59:29 -0700 (PDT) Received: by 10.152.14.65 with HTTP; Fri, 23 Sep 2011 00:59:29 -0700 (PDT) In-Reply-To: References: Date: Fri, 23 Sep 2011 13:29:29 +0530 Message-ID: Subject: Re: Performance degradation observed through embedded cassandra server - pointers needed From: Roshan Dawrani To: hector-users@googlegroups.com Cc: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001517476a72577ec604ad973006 --001517476a72577ec604ad973006 Content-Type: text/plain; charset=ISO-8859-1 Thanks for sharing your inputs, Edward. Some comments inline below: On Thu, Sep 22, 2011 at 7:31 PM, Edward Capriolo wrote: > > >> 1) Should should try to dig in an determine why the truncate is slower. > Look for related jira issues on truncation. > I should give it a try. I thought I might get some readymade pointers from people already knowing about 0.7.2 / 0.8.5 differences on whether our approach to truncate every test has gone even worse due to some changes in that area. > Cassandra had some re-entrant code you could fork a JVM each test and use > the CassandraServiceDataCleaner. (However multiple startups could end up > causing more overhead then the truncation) > > I avoid this problem by using a different column family and or a different > keyspaces for all my unit tests in a single class. Each class bring up a new > embedded cluster and uses the data cleaner to sanitize the data directories. > So essentially I never call truncate. > In both these approaches, won't I need to re-build the schema for every test too? Certainly in the 2nd case, if I end up creating new keyspace or different column families for each test. I am not sure what I will gain there in terms of performance. I was hoping data truncation leaving schema there would be faster than that. -- Roshan Blog: http://roshandawrani.wordpress.com/ Twitter: @roshandawrani Skype: roshandawrani --001517476a72577ec604ad973006 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thanks for sharing your inputs, Edward. Some comments inline below:

=
On Thu, Sep 22, 2011 at 7:31 PM, Edward Capriolo= <edlinuxguru= @gmail.com> wrote:

1) Should should t= ry to dig in an determine why the truncate is slower. Look for related jira= issues on truncation.=A0

I should give it a try. I thought I = might get some readymade pointers from people already knowing about 0.7.2 /= 0.8.5 differences on whether our approach to truncate every test has gone = even worse due to some changes in that area.
=A0
Cassandra had some re-en= trant code you could fork a JVM each test and use the CassandraServiceDataC= leaner. (However multiple startups could end up causing more overhead then = the truncation)

I avoid this problem by using a different column family= and or a different keyspaces for all my unit tests in a single class. Each= class bring up a new embedded cluster and uses the data cleaner to sanitiz= e the data directories. So essentially I never call truncate.

In both these approaches, won't I need= to re-build the schema for every test too? Certainly in the 2nd case, if I= end up creating new keyspace or different column families for each test. I= am not sure what I will gain there in terms of performance. I was hoping d= ata truncation leaving schema there would be faster than that.

--
Roshan
Blog: http://roshandawrani.wordpress.com/<= br>Twitter: = @roshandawrani
Skype: roshandawrani

--001517476a72577ec604ad973006--