Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 11493 invoked from network); 22 Nov 2010 21:52:25 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 22 Nov 2010 21:52:25 -0000 Received: (qmail 23153 invoked by uid 500); 22 Nov 2010 21:52:55 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 23129 invoked by uid 500); 22 Nov 2010 21:52:55 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 23121 invoked by uid 99); 22 Nov 2010 21:52:55 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Nov 2010 21:52:55 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of tlipcon@gmail.com designates 209.85.160.172 as permitted sender) Received: from [209.85.160.172] (HELO mail-gy0-f172.google.com) (209.85.160.172) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Nov 2010 21:52:49 +0000 Received: by gyb13 with SMTP id 13so1905194gyb.31 for ; Mon, 22 Nov 2010 13:52:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:sender:received :in-reply-to:references:from:date:x-google-sender-auth:message-id :subject:to:content-type; bh=AtaPepBL1T/7bqx0oQM5pVnXinqSe4TKDeBfWfTqSfU=; b=QqLMJ1qVmysKVxBKaNS0kCrDb55A99SdWKGzUbwDnbZLrDHHd7p69ZEiJmee0T72P+ V1RgYluhuks2yANzOmTHn+lmyiEE0dOxI1vpl3rBHvqz33iy8rWoJ66w0bIV30n41rIF un93Mp/4nNnE6053A/oGY2UeGFe35vbxmnhmU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:from:date :x-google-sender-auth:message-id:subject:to:content-type; b=sWECVbR3+zam6FzNxH0cwSPSZb0jxDKY6ecgn6HNy4omr+9yQNgYf8W6G4ivbOZKMQ wWC6AV/A7CfNGZAPiNvB9hnYYuV3WZrEEnHZjkMTC0mX7x5HJohKNbE191qoYK6agl1U /zhEAhpFKcuzsUTHxfUfTaaFnYBUbrfHS3fVQ= Received: by 10.231.173.138 with SMTP id p10mr7273437ibz.143.1290462747732; Mon, 22 Nov 2010 13:52:27 -0800 (PST) MIME-Version: 1.0 Sender: tlipcon@gmail.com Received: by 10.231.79.132 with HTTP; Mon, 22 Nov 2010 13:52:07 -0800 (PST) In-Reply-To: References: From: Todd Lipcon Date: Mon, 22 Nov 2010 13:52:07 -0800 X-Google-Sender-Auth: WYr5S1DRt1VYIBdTPJLHQx3eMOg Message-ID: Subject: Re: cassandra vs hbase summary (was facebook messaging) To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0014853d202eacacd90495ab45bb --0014853d202eacacd90495ab45bb Content-Type: text/plain; charset=ISO-8859-1 On Mon, Nov 22, 2010 at 1:26 PM, Edward Capriolo wrote: > For cassandra all writes must be transmitted to all replicas. > CASSANDRA-1314 does not change how writes happen. Write operations > will still effect cache (possibly evicting things if cache is full). > Reads however will prefer a single node of it's possible replicas. > This should cause better cache utilization and less duplication for > those using READ.ONE with lower read repair settings. > > It is also worth pointing out that the HBase cache is on entire hdfs > blocks, Nope, the HBase cache is on HFile blocks, which are typically 64KB. > while Cassandra can cache on keys (key cache) or a key and all > it's columns (row cache). This has some deep implications based on how > random your reads are. Even with Cassandra's normal cache duplication > having more fined grained caches, of or rows rather then blocks, could > mean that they are more efficient anyway. > --0014853d202eacacd90495ab45bb Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On Mon, Nov 22, 2010 at 1:26 PM, Edward Capriolo <edlinuxguru@gmail.com> w= rote:
For cassandra all writes must be transmitted to all replicas.
CASSANDRA-1314 does not change how writes happen. Write operations
will still effect cache (possibly evicting things if cache is full).
Reads however will prefer a single node of it's possible replicas.
This should cause better cache utilization and less duplication for
those using READ.ONE with lower read repair settings.

It is also worth pointing out that the HBase cache is on entire hdfs
blocks,

Nope, the HBase cache is on HFile b= locks, which are typically 64KB.
=A0
while Cassandra can cache on keys (key cache) or a key and all
it's columns (row cache). This has some deep implications based on how<= br> random your reads are. Even with Cassandra's normal cache duplication having more fined grained caches, of or rows rather then blocks, could
mean that they are more efficient anyway.

--0014853d202eacacd90495ab45bb--