Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 97829 invoked from network); 19 Apr 2010 13:49:58 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 19 Apr 2010 13:49:58 -0000 Received: (qmail 36602 invoked by uid 500); 19 Apr 2010 13:49:57 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 36578 invoked by uid 500); 19 Apr 2010 13:49:57 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 36570 invoked by uid 99); 19 Apr 2010 13:49:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Apr 2010 13:49:56 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gdusbabek@gmail.com designates 74.125.83.44 as permitted sender) Received: from [74.125.83.44] (HELO mail-gw0-f44.google.com) (74.125.83.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Apr 2010 13:49:49 +0000 Received: by gwj20 with SMTP id 20so382326gwj.31 for ; Mon, 19 Apr 2010 06:49:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:reply-to:in-reply-to :references:date:received:message-id:subject:from:to:content-type :content-transfer-encoding; bh=jpXv1iybJiVkzCTcUY6oIubONA3h77m34+wWopPSRqo=; b=i1YByavQ5zoq38DBLuUHv2FxMiFBH5WjG6H8eAe/5w6TKcRQzzaGwCKVwpOTNyiSRQ 4P9n9Vt3zF5/KRrc0NmU8lmv05inRAGYEDWxSAXICFGHQ+F2ut7I1QgJT2wRaTRY1oTN OvCkVPMhb7Irrx3wl49Y7NWLoouz7HuwSlUBU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:reply-to:in-reply-to:references:date:message-id :subject:from:to:content-type:content-transfer-encoding; b=CUj5ZMBD4E26NUTEJluCevt/nF5IsBIWzv2B4TmRydDSqHMe+MYUXaSlmAl0D0hG5b ykEXUsEw5s6rErvOVvdsKX672IhWu6OEKl9ien7fr86A/NlbEtO5+0OdzU7xfxqMsgTh EzZICc0LtH0VerbgcMjFCwZQp0StNeHAKA3xI= MIME-Version: 1.0 Received: by 10.151.102.5 with HTTP; Mon, 19 Apr 2010 06:49:27 -0700 (PDT) Reply-To: gdusbabek@gmail.com In-Reply-To: References: <4BC88D97.7050104@gmail.com> Date: Mon, 19 Apr 2010 08:49:27 -0500 Received: by 10.150.252.8 with SMTP id z8mr6401297ybh.3.1271684968130; Mon, 19 Apr 2010 06:49:28 -0700 (PDT) Message-ID: Subject: Re: Regarding Cassandra Scalability From: Gary Dusbabek To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org On Sun, Apr 18, 2010 at 11:14, dir dir wrote: > Hi Gary, > >>The main reason is that the compaction operation (removing deleted >>values) currently requires that an entire row be read into memory. > > Thank you for your explanation. But I still do not understand what do you > mean. > When you delete a column in cassandra, the data is not really deleted. Instead a flag is turned on indicating the column is no longer valid (we call it a 'tombstone'). During compaction the column family is scanned and the tombstones are truly deleted. > in my opinion, Actually the row contents must fit in available memory. > if row contents are not fit in available memory, our software will raise > exception out of memory. since it is true( "the row contents must fit in > available memory"), > then why you said that is a problem which it (Cassandra) cannot solved?? > It was not correct of me to say it is a problem that cassandra cannot solve. Memory-efficient compactions will be addressed. > You say: "compaction operation requires that entire row be read into memo= ry" > whether this is a problem of "out of memory"??=A0 When we need to perform > compaction operation?? In what situation we shall perform compaction > operation?? You will need to address the large rows yourself (consider breaking them up). You can identify these rows during compaction by setting RowWarningThresholdInMB in storage-conf.xml. When a big enough row comes along, it is logged so you can go back later and address the problem. Regards, Gary. > Thank You. > > Dir. >