Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 40021 invoked from network); 20 Mar 2010 18:20:26 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 Mar 2010 18:20:26 -0000 Received: (qmail 61254 invoked by uid 500); 20 Mar 2010 15:30:24 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 61225 invoked by uid 500); 20 Mar 2010 15:30:24 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 61217 invoked by uid 99); 20 Mar 2010 15:30:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 20 Mar 2010 15:30:24 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=AWL,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [64.191.203.36] (HELO mail.digg.com) (64.191.203.36) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 20 Mar 2010 15:30:18 +0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by mail.digg.com (Postfix) with ESMTP id 7D7FFA84F84 for ; Sat, 20 Mar 2010 08:29:57 -0700 (PDT) X-Virus-Scanned: amavisd-new at d242.digg.internal Received: from mail.digg.com ([127.0.0.1]) by localhost (mail.digg.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ODJUsHd5RmXP for ; Sat, 20 Mar 2010 08:29:51 -0700 (PDT) Received: from [10.2.16.222] (diggstage01.digg.com [64.191.203.34]) by mail.digg.com (Postfix) with ESMTP id 8C73BA84FAD for ; Sat, 20 Mar 2010 08:29:51 -0700 (PDT) From: Chris Goffinet Mime-Version: 1.0 (Apple Message framework v1077) Content-Type: multipart/alternative; boundary=Apple-Mail-3-881498816 Subject: Re: Digg's data model Date: Sat, 20 Mar 2010 08:29:51 -0700 In-Reply-To: <240826b41003200153p763258edna452b8082b4c7c5f@mail.gmail.com> To: user@cassandra.apache.org References: <240826b41003200153p763258edna452b8082b4c7c5f@mail.gmail.com> Message-Id: <17409F87-E390-458B-9A43-A60276B0A522@chrisgoffinet.com> X-Mailer: Apple Mail (2.1077) --Apple-Mail-3-881498816 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii > Also, Does cassandra support counters? Digg's article said they are = going to contribute their work to open source any idea when that would = be? >=20 All of the custom work has been pushed upstream from Digg and continues. = We have a few operational tools we will be releasing that will go into = contrib/. I would say Counters w/ vector clocks should have some jira = patches in few weeks. We have someone full-time working on that.=20 > Thanks in advance for sharing your experience >=20 > Lenin >=20 >=20 >=20 > On Fri, Mar 19, 2010 at 1:03 PM, Jonathan Ellis = wrote: > Jeff Hodsdon edited the new link in: > http://about.digg.com/blog/looking-future-cassandra >=20 > On Fri, Mar 19, 2010 at 2:49 PM, Nathan McCall = wrote: > > Gary, > > Did you see this larticle linked from the Cassandra wiki? > > http://about.digg.com/node/564 > > > > See http://wiki.apache.org/cassandra/ArticlesAndPresentations for = more > > examples like the above. In general, you structure your data = according > > to how it will be queried. This can lead to duplication, but that is > > one of the trade-offs for performance and scale. > > > > Digg folks - the "Looking to the Future with Cassandra" linked on = the > > wiki is no longer available. I found that article quite helpful > > originally. Is there a chance this could be re-posted? > > > > Cheers, > > -Nate > > > > On Fri, Mar 19, 2010 at 12:16 PM, Gary wrote: > >> I am a newbie to bigtable like model and have a question as = follows. Take > >> Digg as an example, I want to find a list users who dug a URL and = also want > >> to find a list of URLs a user dug. How should the data model look = like for > >> the queries to be efficient? If I use the username and the URL for = two rows, > >> when a user digs a URL, I will have to update two rows so I need a > >> transaction to keep data consistent. > >> Any thoughts? > >> Thanks, > >> Gary > > >=20 >=20 >=20 > --=20 > twitter: leningali > skype: galilenin > Cell:513.382.3371 --Apple-Mail-3-881498816 Content-Transfer-Encoding: 7bit Content-Type: text/html; charset=us-ascii
Also, Does cassandra support counters? Digg's article said they are going to contribute their work to open source any idea when that would be?


All of the custom work has been pushed upstream from Digg and continues. We have a few operational tools we will be releasing that will go into contrib/. I would say Counters w/ vector clocks should have some jira patches in few weeks. We have someone full-time working on that. 

Thanks in advance for sharing your experience

Lenin



On Fri, Mar 19, 2010 at 1:03 PM, Jonathan Ellis <jbellis@gmail.com> wrote:
Jeff Hodsdon edited the new link in:
http://about.digg.com/blog/looking-future-cassandra

On Fri, Mar 19, 2010 at 2:49 PM, Nathan McCall <nate@vervewireless.com> wrote:
> Gary,
> Did you see this larticle linked from the Cassandra wiki?
> http://about.digg.com/node/564
>
> See http://wiki.apache.org/cassandra/ArticlesAndPresentations for more
> examples like the above. In general, you structure your data according
> to how it will be queried. This can lead to duplication, but that is
> one of the trade-offs for performance and scale.
>
> Digg folks - the "Looking to the Future with Cassandra" linked on the
> wiki is no longer available. I found that article quite helpful
> originally. Is there a chance this could be re-posted?
>
> Cheers,
> -Nate
>
> On Fri, Mar 19, 2010 at 12:16 PM, Gary <daxiatou@gmail.com> wrote:
>> I am a newbie to bigtable like model and have a question as follows. Take
>> Digg as an example, I want to find a list users who dug a URL and also want
>> to find a list of URLs a user dug. How should the data model look like for
>> the queries to be efficient? If I use the username and the URL for two rows,
>> when a user digs a URL, I will have to update two rows so I need a
>> transaction to keep data consistent.
>> Any thoughts?
>> Thanks,
>> Gary
>



--
twitter: leningali
skype: galilenin
Cell:513.382.3371

--Apple-Mail-3-881498816--