Return-Path: Delivered-To: apmail-cassandra-dev-archive@www.apache.org Received: (qmail 605 invoked from network); 1 Oct 2010 15:12:37 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 1 Oct 2010 15:12:37 -0000 Received: (qmail 35003 invoked by uid 500); 1 Oct 2010 15:12:37 -0000 Delivered-To: apmail-cassandra-dev-archive@cassandra.apache.org Received: (qmail 34675 invoked by uid 500); 1 Oct 2010 15:12:34 -0000 Mailing-List: contact dev-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list dev@cassandra.apache.org Received: (qmail 34667 invoked by uid 99); 1 Oct 2010 15:12:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Oct 2010 15:12:33 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of schumi.han@gmail.com designates 209.85.214.44 as permitted sender) Received: from [209.85.214.44] (HELO mail-bw0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Oct 2010 15:12:27 +0000 Received: by bwz9 with SMTP id 9so2962202bwz.31 for ; Fri, 01 Oct 2010 08:12:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=ga41+iuX3a6hkrLHm3pdo/p9+buV5lz628h/1pn1dMw=; b=iChEI1u6ya4gU7ikDx9iCQUZFPXtuvwLwM7K2jTD39nkIB9nXrxpUKzv7N/tgIBjD4 OdV3iHf9uD4OwB9pwrKcfwHtIXx4ka8tLAyNGoDMMVMYIlsr89Oc9Vr+dHHI0n3kva/J a20REcSlQHSxGHWXTfTX1yRYK4m7VGBD9n+Q0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=rjtnXbm8qiwt8VJ4dvViGtkI4IEBzNdEgsBJIlGfcz4dR2ZvK0WtVVFn4uRQzbSwh4 TldzHjRqLFgwelkfwMxLliFnUR8XG2BA5v+dtvOWnYkdSSoj6sBDGVrzzMk4obUa998Z FyemDPreOwarpl6LutjhBYukHw61dfwjMpNvk= MIME-Version: 1.0 Received: by 10.204.112.7 with SMTP id u7mr3742403bkp.70.1285945926139; Fri, 01 Oct 2010 08:12:06 -0700 (PDT) Received: by 10.204.163.1 with HTTP; Fri, 1 Oct 2010 08:12:06 -0700 (PDT) In-Reply-To: References: <3CCCC121-BD60-4D3B-B7AA-353CEAB9C241@oskarsson.nu> <4C0416C5-6422-48D0-9055-092543C47C42@oskarsson.nu> <0073704B-BDD4-44D1-8CCD-44C9B084A3EF@gmail.com> Date: Fri, 1 Oct 2010 23:12:06 +0800 Message-ID: Subject: Re: [DISCUSSION] High-volume counters in Cassandra From: Zhu Han To: dev@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016e6d566b420c0de04918f9e19 --0016e6d566b420c0de04918f9e19 Content-Type: text/plain; charset=ISO-8859-1 > They have however at least one advantage: > - your super columns are indexed, you don't have to deserialize them > entirely each time. > The size of counter super column is limited to how many replicas propagated values as the lead replica. It's size is upper bounded by the number of replicas. Even if we support hinted hand off of counters, the size of super columns just equals to the size of nodes in the cluster, as the worse case. IMHO, it's not a big deal to de-serialize them entirely, the can fit into memory very easily. Did I miss anything here? One advantage of implementing counter as a new CF type: --you can add some counter specific configuration very easily. best regards, hanzhu To sum up, I can see the following drawbacks to such encoding: > - querying SC by names is less efficient. > - it takes more disk space (but that's the cheapest resource we have > isn't it). > > I'd say these are fair compromises. > > -- > Sylvain > --0016e6d566b420c0de04918f9e19--