Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B65FCD4CB for ; Thu, 19 Jul 2012 04:54:50 +0000 (UTC) Received: (qmail 60578 invoked by uid 500); 19 Jul 2012 04:54:48 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 60539 invoked by uid 500); 19 Jul 2012 04:54:48 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 60513 invoked by uid 99); 19 Jul 2012 04:54:47 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jul 2012 04:54:47 +0000 X-ASF-Spam-Status: No, hits=0.6 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [74.125.149.153] (HELO na3sys009aog125.obsmtp.com) (74.125.149.153) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 19 Jul 2012 04:54:37 +0000 Received: from mail-vb0-f46.google.com ([209.85.212.46]) (using TLSv1) by na3sys009aob125.postini.com ([74.125.148.12]) with SMTP ID DSNKUAeS99QeHtcsH6mko/juLwNFZ9B7Ng5m@postini.com; Wed, 18 Jul 2012 21:54:17 PDT Received: by vbbff1 with SMTP id ff1so1838522vbb.5 for ; Wed, 18 Jul 2012 21:54:14 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:date:message-id:subject:from:to:cc:content-type :x-gm-message-state; bh=qGksKF+KVnEdkSiNSXbLDcZXe1Is92AodGiSIDcp03g=; b=kVSaeg4454F/u2wGEV8UTbtiSNxip8+0ZuMqCu+slkwnDWmPXfzgua7KBLQEJfEk+6 henwANVL/MhvCQNYmSqojv5EpYer8b4f4bRBnJ9Ba2fDU5sL9YRxxuAbl52b5XW0wAi2 4d/6lKaQSdhWYSnyKzdgAPEF21/o9BG7tU86gukiMtXGV6mGkgNK+i7eThBfEWh8vJan uMsB8lKksMgobg6XFIr6IsfqIyCPUrtONoFTKU2O9NSsLndF2Gw0BjfFkG+NvZqeItbX axvAs3PhoSEiHQVZDih0mWhbjZNLUJM4JOysMqiLU+bGRU1txRxvtuc9HS3pN1FbW/G8 tA9g== MIME-Version: 1.0 Received: by 10.52.65.51 with SMTP id u19mr202132vds.17.1342673654504; Wed, 18 Jul 2012 21:54:14 -0700 (PDT) Received: by 10.220.6.146 with HTTP; Wed, 18 Jul 2012 21:54:14 -0700 (PDT) Date: Thu, 19 Jul 2012 14:54:14 +1000 Message-ID: Subject: Cassandra startup times From: Ben Kaehne To: user@cassandra.apache.org Cc: Franc Carter , David Nelson , Andrew Thomson Content-Type: multipart/alternative; boundary=20cf3071d0b239dbdf04c52792d4 X-Gm-Message-State: ALoCoQkHq190FzuUBZ1Xc9ubRsPhPYkNBKd5UqexIPwCX8r0RzZLr2AMw0kN3rU9Fx+yodsoxkcR --20cf3071d0b239dbdf04c52792d4 Content-Type: text/plain; charset=ISO-8859-1 Good evening, I am interested in improving the startup time of our cassandra cluster. We have a 3 node cluster (replication factor of 3) in which our application requires quorum reads and writes to function. Each machine is well specced with 24gig of ram, 10 cores, jna enabled etc. On each server our keyspace files are so far around 90 Gb (stored on NFS although I am not seeing signs that we have much network io). This size will grow in future. Our startup time for 1 server at the moment is greater then half an hour (45 minutes to 50 minutes even) which is putting a risk factor on the resiliance of our service. I have tried version 1.09 to latest 1.12. I do not see too much system utilization while starting either. I gazed apon an article suggesting increased speed in 1.2 although when I set it up, it did not seem to be any faster at all (if not slower). I was observing what was happening during startup and I noticed (via strace), cassandra was doing lots of 8 byte reads from: /var/lib/cassandra/data/XXXXXX/YYYYYY/XXXXXXX-YYYYYYY-hc-1871-CompressionInfo.db /var/lib/cassandra/data/XXXXXX/YYYYYY/XXXXXXX-YYYYYYY-hc-1874-CompressionInfo.db Also... Is there someone I can change the 8 byte reads to something greater? 8 byte reads across NFS is terribly inefficient (and I am guessing the cause of our terribly slow startup times). Regards, -- -Ben --20cf3071d0b239dbdf04c52792d4 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Good evening,

I am interested in improving the startup t= ime of our cassandra cluster.

We have a 3 node clu= ster (replication factor of 3) in which our application requires quorum rea= ds and writes to function.

Each machine is well specced with 24gig of ram, 10 core= s, jna enabled etc.

On each server our keyspace fi= les are so far around 90 Gb (stored on NFS although I am not seeing signs t= hat we have much network io). This size will grow in future.

Our startup time for 1 server at the moment i= s greater then half an hour (45 minutes to 50 minutes even) which is puttin= g a risk factor on the resiliance of our service. I have tried version 1.09= to latest 1.12.

I do not see too much system=A0utilization=A0while star= ting either.

I gazed apon an article suggesting in= creased speed in 1.2 although when I set it up, it did not seem to be any f= aster at all (if not slower).

I was observing what was happening during startup and I= noticed (via strace), cassandra was doing lots of 8 byte reads from:
=

=A0/var/lib/cassandra/data/XXXXXX/YYYYYY/XXXXXXX-YYYYYY= Y-hc-1871-CompressionInfo.db
=A0/var/lib/cassandra/data/XXXXXX/YYYYYY/XXXXXXX-YYYYYYY-hc-1874-= CompressionInfo.db

Also... Is there someone = I can change the 8 byte reads to something greater? 8 byte reads across NFS= is terribly inefficient (and I am guessing the cause of our terribly slow = startup times).

Regards,

--
-Ben
--20cf3071d0b239dbdf04c52792d4--