Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 40203 invoked from network); 23 Feb 2010 13:32:34 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 23 Feb 2010 13:32:34 -0000 Received: (qmail 27313 invoked by uid 500); 23 Feb 2010 13:32:34 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 27299 invoked by uid 500); 23 Feb 2010 13:32:34 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 27290 invoked by uid 99); 23 Feb 2010 13:32:34 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Feb 2010 13:32:34 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 74.125.82.47 as permitted sender) Received: from [74.125.82.47] (HELO mail-ww0-f47.google.com) (74.125.82.47) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Feb 2010 13:32:26 +0000 Received: by wwb22 with SMTP id 22so672795wwb.6 for ; Tue, 23 Feb 2010 05:32:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=0tvwIRTSdbJW+cvlhxQaGy2o0qzrWpFJgIaX6ITKzmw=; b=k/sGArRjGqX3DIUjj/HURLg6Eh7L/BJuYpDGZXqwVteVGrVMHZAd4sXbtNkXuipDkk zTsTjOGyHKCgqx34BlNmWg8X3Ulh+2irKcyY5upNOu7Z6jkxbUIn7ZxuifueGa6e3yuR rPNXBIq3tyY7VBh77X5RDX4j+DssV20FlwZWQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=TbHnmQWCUxqGfG3u4/s5Z++xMFQfLvISPuyxV1S9Q3Uu/1cUqKhMCKvjuH+KEhUxBU CQYOLfzySo33h/xRNX42hYAbjZOC74eLAAl4LYkWW2CHmzabCmmYDnXwE8dPz6GsSCoL 4bbpgACHHkOV2JOyF8POw3irTQcmiHMv70WL4= MIME-Version: 1.0 Received: by 10.216.159.6 with SMTP id r6mr1687676wek.67.1266931926218; Tue, 23 Feb 2010 05:32:06 -0800 (PST) In-Reply-To: <4b8376b4.0603c00a.0ba2.739f@mx.google.com> References: <4b8376b4.0603c00a.0ba2.739f@mx.google.com> From: Jonathan Ellis Date: Tue, 23 Feb 2010 07:31:46 -0600 Message-ID: Subject: Re: problem about bootstrapping when used in huge node To: cassandra-user@incubator.apache.org Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org On Tue, Feb 23, 2010 at 12:33 AM, Michael Lee wrote: > (1)=A0=A0=A0=A0 A cluster cannot be enlarge(add more node into cluster) i= f it > already used more than half capacity: > > If every node has data more than it=92s half capacity , the admin may not > bootstrapping new node into cluster, > > because old nodes must strip data belong to new node through > anti-compaction, the process will create a large tmp SSTable > file (for streaming), which may large than free disk space ( of one node = ). That's right, in the worst case. On average, any node sending to a bootstrap node will only have to anti-compact half its data. We have https://issues.apache.org/jira/browse/CASSANDRA-579 open to allow streaming data w/o first writing it locally. > (1) Is cassandra designed to waste half of it=92s capacity? Yes, although I might describe it as "cassandra requires up to half its capacity as temporary space for compaction and anticompaction." http://wiki.apache.org/cassandra/MemtableSSTable That's the price you pay for no random writes. > (2)=A0=A0=A0=A0 How to use node has 12 1TB disk?? You should use a better filesystem than ext3. :) We use xfs at rackspace. -Jonathan