Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6AEC89B42 for ; Sun, 20 Nov 2011 20:33:14 +0000 (UTC) Received: (qmail 15324 invoked by uid 500); 20 Nov 2011 20:33:12 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 15291 invoked by uid 500); 20 Nov 2011 20:33:12 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 15283 invoked by uid 99); 20 Nov 2011 20:33:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Nov 2011 20:33:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of stephen.alan.connolly@gmail.com designates 74.125.82.42 as permitted sender) Received: from [74.125.82.42] (HELO mail-ww0-f42.google.com) (74.125.82.42) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Nov 2011 20:33:06 +0000 Received: by wwe3 with SMTP id 3so4370022wwe.1 for ; Sun, 20 Nov 2011 12:32:44 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=991iY5RIDDdxl/FsXp7ZDpBWoKh4omX/Z5hK81MvGVA=; b=RAf2qmARvWeaKLi+SMoOoecSBSmcoK0lUXmH5cFjlBFnBAIX6D23D4m3iBuRlP50ae otNLqYjp6Im68Mm0AbSiAXZtHbFMhTmgu/vBLx5DeTmjc83CIhVhgKnvPPHImIV9AU5O 9qCFhlILcWmTRKRGJvFa9+snrvwkqnPpijP7Y= MIME-Version: 1.0 Received: by 10.180.84.33 with SMTP id v1mr11395968wiy.4.1321821164756; Sun, 20 Nov 2011 12:32:44 -0800 (PST) Received: by 10.216.169.137 with HTTP; Sun, 20 Nov 2011 12:32:44 -0800 (PST) Received: by 10.216.169.137 with HTTP; Sun, 20 Nov 2011 12:32:44 -0800 (PST) In-Reply-To: References: Date: Sun, 20 Nov 2011 20:32:44 +0000 Message-ID: Subject: Re: data agility From: Stephen Connolly To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=f46d0442863efb5d6504b230784e --f46d0442863efb5d6504b230784e Content-Type: text/plain; charset=ISO-8859-1 if your startup is bootstrapping then cassandra is sometimes to heavy to start with. i.e. it needs to be fed ram... you're not going to seriously run it in less than 1gb per node... that level of ram commitment can be too much while bootstrapping. if your startup has enough cash to pay for 3-5 recommended spec (see wiki) nodes to be up 24/7 then cassandra is a good fit... a friend of mine is bootstrapping a startup and had to drop back to mysql while he finds his pain points and customers... he knows he will end up jumping back to cassandra when he gets enough customers (or a VC) but for now the running costs are too much to pay from his own pocket... note that the jdbc driver and cql will make jumping back easy for him (as he still tests with c*... just runs at present against mysql.... nuts eh!) - Stephen --- Sent from my Android phone, so random spelling mistakes, random nonsense words and other nonsense are a direct result of using swype to type on the screen On 20 Nov 2011 19:07, "Dotan N." wrote: > Hi all, > my question may be more philosophical than related technically > to Cassandra, but please bear with me. > > Given that a young startup may not know its product full at the early > stages, but that it definitely points to ~200M users, > would Cassandra will be the right way to go? > > That is, the requirement is for a large data store, that can move with > product changes and requirements swiftly. > > Given that in Cassandra one thinks hard about the queries, and then builds > a model to suit it best, I was thinking of > this situation as problematic. > > So here are some questions: > > - would it be wiser to start with a more agile data store (such as > mongodb) and then progress onto Cassandra, when the product itself > solidifies? > - given that we start with Cassandra from the get go, what is a common > (and quick in terms of development) way or practice to change data, change > schemas, as the product evolves? > - is it even smart to start with Cassandra? would only startups whose core > business is big data start with it from the get go? > - how would you do map/reduce with Cassandra? how agile is that? (for > example, can you run map/reduce _very_ frequently?) > > Thanks! > > -- > Dotan, @jondot > > --f46d0442863efb5d6504b230784e Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable

if your startup is bootstrapping then cassandra is sometimes to heavy to= start with.

i.e. it needs to be fed ram... you're not going to seriously run it = in less than 1gb per node... that level of ram commitment can be too much w= hile bootstrapping.

if your startup has enough cash to pay for 3-5 recommended spec (see wik= i) nodes to be up 24/7 then cassandra is a good fit...

a friend of mine is bootstrapping a startup and had to drop back to mysq= l while he finds his pain points and customers... he knows he will end up j= umping back to cassandra when he gets enough customers (or a VC) but for no= w the running costs are too much to pay from his own pocket... note that th= e jdbc driver and cql will make jumping back easy for him (as he still test= s with c*... just runs at present against mysql.... nuts eh!)

- Stephen

---
Sent from my Android phone, so random spelling mistakes, random nonsense wo= rds and other nonsense are a direct result of using swype to type on the sc= reen

On 20 Nov 2011 19:07, "Dotan N." <<= a href=3D"mailto:dipidi@gmail.com">dipidi@gmail.com> wrote:
Hi all,
my question may be more=A0philosophical=A0than= related technically to=A0Cassandra, but please bear with me.
Given that a young startup may not know its product full at the= early stages, but that it definitely points to ~200M users,
would Cassandra will be the right way to go?

= That is, the requirement is for a large data store, that can move with prod= uct changes and requirements swiftly.

Given that i= n Cassandra one thinks hard about the queries, and then builds a model to s= uit it best, I was thinking of
this situation as problematic.

So here are so= me questions:

- would it be wiser to start with a = more agile data store (such as mongodb) and then progress onto Cassandra, w= hen the product itself solidifies?
- given that we start with Cassandra from the get go, what is a common= (and quick in terms of development) way or practice to change data, change= schemas, as the product evolves?
- is it even smart to start wit= h Cassandra? would only startups whose core business is big data start with= it from the get go?
- how would you do map/reduce with Cassandra? how agile is that? (for = example, can you run map/reduce _very_ frequently?)

Thanks!

--
Dotan,=A0@jondot

--f46d0442863efb5d6504b230784e--