Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 84860 invoked from network); 20 Jan 2011 13:28:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 20 Jan 2011 13:28:23 -0000 Received: (qmail 37572 invoked by uid 500); 20 Jan 2011 13:28:21 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 37444 invoked by uid 500); 20 Jan 2011 13:28:15 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 37431 invoked by uid 99); 20 Jan 2011 13:28:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Jan 2011 13:28:14 +0000 X-ASF-Spam-Status: No, hits=4.0 required=10.0 tests=FREEMAIL_FROM,FREEMAIL_REPLY,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of javier.canillas@gmail.com designates 209.85.216.44 as permitted sender) Received: from [209.85.216.44] (HELO mail-qw0-f44.google.com) (209.85.216.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Jan 2011 13:28:06 +0000 Received: by qwi2 with SMTP id 2so630458qwi.31 for ; Thu, 20 Jan 2011 05:27:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=NZ4dH7dUmrs9ceEoF1jQjnaCymoL5K/upHIh4UCdRR4=; b=rvAEST87VvIolp1MlnyHh4pyqVH0u8gGF6w2sAU8RAnFgdVpOcyozY9cKMf4GO0G2b vtfh4hxDf6KrMVHE+fE+yFjDwHkYsQP1UFjYRIdA7s5xN6AfdqK1lbB3CwvSh/ahtXqH ER1grgQnuprRCZfLMBYCToMnjUV2mQN5mgOnY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=VW9tHgKYFn7GWzoLOUFTmM4sz4rE5JEhxtSPrqEYFK3fPI7+yn8ZTT6dNMrm8ZZExW vjBIqrK3fav5THNjInfa7fMOGvkHrSIdObiUFbCqX/5US6OlItZN4/6YOXWTrTpLFSdt 2xFZ1w4TOHUWbbJ0FXfLFK5BHWTn1g+8+Leo8= MIME-Version: 1.0 Received: by 10.229.192.149 with SMTP id dq21mr1795099qcb.57.1295530065731; Thu, 20 Jan 2011 05:27:45 -0800 (PST) Received: by 10.220.194.8 with HTTP; Thu, 20 Jan 2011 05:27:45 -0800 (PST) In-Reply-To: References: Date: Thu, 20 Jan 2011 10:27:45 -0300 Message-ID: Subject: Re: Compression in Cassandra From: Javier Canillas To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016363b83c05d2080049a471983 --0016363b83c05d2080049a471983 Content-Type: text/plain; charset=UTF-8 How do you calculate your 40g data? When you insert it into Cassandra, you need to convert the data into a Byte[], maybe your problem is there. On Thu, Jan 20, 2011 at 10:02 AM, akshatbakliwal@gmail.com < akshatbakliwal@gmail.com> wrote: > Hi all, > > I am experiencing a unique situation. I loaded some data onto Cassandra. > my data was about 40 GB but when loaded to Cassandra the data directory > size is almost 170GB. > > This means the **data got inflated**. > > Is it the case just with me or some else is also facing the inflation or > its the general behavior of Cassandra. > > I am using Cassandra 0.6.8. on Ubuntu 10.10 > > -- > Akshat Bakliwal > Search Information and Extraction Lab > IIIT-Hyderabad > 09963885762 > WebPage > > --0016363b83c05d2080049a471983 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable How do you calculate your 40g data? When you insert it into Cassandra, you = need to convert the data into a Byte[], maybe your problem is there.
On Thu, Jan 20, 2011 at 10:02 AM, akshatbakliwal@gmail.com <akshatbakliwal@gmail.com= > wrote:
Hi all,

I am experiencing a unique s= ituation. I loaded some data onto Cassandra.
my data was about 40 GB but= when loaded to Cassandra the data directory size is almost 170GB.

This means the **data got inflated**.

Is it the case just with me or some else is also facing the inflation o= r its the general behavior of Cassandra.

I am using Cassandra 0.6.8.= on Ubuntu 10.10

--
Akshat= Bakliwal
Search Information and Extraction Lab
IIIT-Hyderabad
09963885762
WebPage


--0016363b83c05d2080049a471983--