Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 30054 invoked from network); 25 May 2010 18:28:02 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 May 2010 18:28:02 -0000 Received: (qmail 9122 invoked by uid 500); 25 May 2010 18:28:01 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 9090 invoked by uid 500); 25 May 2010 18:28:01 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 9081 invoked by uid 99); 25 May 2010 18:28:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 May 2010 18:28:01 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=10.0 tests=AWL,FREEMAIL_FROM,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jbellis@gmail.com designates 74.125.82.43 as permitted sender) Received: from [74.125.82.43] (HELO mail-ww0-f43.google.com) (74.125.82.43) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 May 2010 18:27:57 +0000 Received: by wwg30 with SMTP id 30so100737wwg.30 for ; Tue, 25 May 2010 11:27:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=3jugCHctL64ouQ4YgFfifoCXiEt8SccZ2u2rHhVUn1o=; b=TpZiiNfOFn8heCnL9h44inuPBU0Vxmkp3kPMcCxPpBoHmqds9QXlZczqmxdEZXD1JF VT1JuwFWPeK0MoNkkKlFdleg9EMKN4epF9++Lcf5NkCqp+srGGSqajIivSx47xeLJAXv D8tNXNtEJ5ZVNK5G3VJZ+q6ah+nq7SPbveNTU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=F6IOFJno+4tj1eLOCAd4Fp2SaioSw0o08b8kggcseNZnzr5+CurHNrydF8nIr5U+7x 9ga6GOa+4/YJQZvlUVhalNiQuvlzoDhk4AMKZ9YMSrA2dCO6ExM0oOuUsNJFwo2AFpS3 gQGXfLPOlNjLMvyPbV3EqdFBKmlpcBhje00sU= Received: by 10.216.93.81 with SMTP id k59mr4849504wef.92.1274812055139; Tue, 25 May 2010 11:27:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.17.197 with HTTP; Tue, 25 May 2010 11:27:15 -0700 (PDT) In-Reply-To: References: From: Jonathan Ellis Date: Tue, 25 May 2010 13:27:15 -0500 Message-ID: Subject: Re: Why Cassandra is "space inefficient" compared to MySQL? To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable the only place we use a java serializer is for the BitSet in bloom filters. On Tue, May 25, 2010 at 12:37 PM, Chris Goffinet wrote: > My money is on the fact that the serializer is just horribly verbose. It'= s > using a basic set of the java serializer. > -Chris > > > On Tue, May 25, 2010 at 10:02 AM, Ryan King wrote: >> >> Also, timestamps for each column. >> >> -ryan >> >> On Tue, May 25, 2010 at 5:41 AM, Jonathan Ellis wrot= e: >> > That's true. =A0But fundamentally Cassandra is expected to use more >> > space than mysql for a few reasons; usually the biggest factor is that >> > Cassandra has to write out each column name in each row, since column >> > names are dynamic unlike in mysql where you declare the columns once >> > for the whole table. >> > >> > 2010/5/25 Peter Sch=FCller : >> >>> Could you please tell me why? >> >> >> >> There might be pending sstable removals on disk, which won't happen >> >> until GC or restart. If you just did a bulk insert and checked >> >> diskspace immediately afterwards, I think this is a possible >> >> explanation. >> >> >> >> (See "Write path" on >> >> http://wiki.apache.org/cassandra/ArchitectureInternals) >> >> >> >> -- >> >> / Peter Schuller aka scode >> >> >> > >> > >> > >> > -- >> > Jonathan Ellis >> > Project Chair, Apache Cassandra >> > co-founder of Riptano, the source for professional Cassandra support >> > http://riptano.com >> > > > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com