Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4910D49C1 for ; Thu, 7 Jul 2011 02:24:35 +0000 (UTC) Received: (qmail 37307 invoked by uid 500); 7 Jul 2011 02:24:32 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 37186 invoked by uid 500); 7 Jul 2011 02:24:32 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 37178 invoked by uid 99); 7 Jul 2011 02:24:32 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 02:24:31 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a41.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 02:24:25 +0000 Received: from homiemail-a41.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a41.g.dreamhost.com (Postfix) with ESMTP id A4C0D44C05D for ; Wed, 6 Jul 2011 19:24:04 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=subject :references:from:content-type:in-reply-to:message-id:date:to :content-transfer-encoding:mime-version; q=dns; s= thelastpickle.com; b=WZr5AVAjxycC7gUy9kFAkcodLfwAXyPBiR5lc+nOpZ/ thDeq4SMVivyxRQU3YK8Vt1IEZh9dvWRx4wXDdEK4mtXLGWwVnemzgHr+H/eGAcR Npt3RudkDNWxz8J+7QJWTaPEPuB6SZpWUgQFxJZiJuPd6CH/p6Dhk08bY7pRhrUA = DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= subject:references:from:content-type:in-reply-to:message-id:date :to:content-transfer-encoding:mime-version; s=thelastpickle.com; bh=UalDzkhp6S0tcCfa90U0zj80ZLI=; b=Wqfz1LpglUybiwWSRJVRm0hJs78i iPSbAyH2LcJn6L4Uin9JrkYCq64qYL3wikqERpCEXb3qzwiPq8v8An42K904wmvF l2UiyEnV5RmAC0pSemk/SK6IFqeqPr8j3g2/1+AimqwavlrkTOWh3EvyQ4Y0ASoJ eUtpVkNqtZjqYg8= Received: from [115.189.55.21] (unknown [115.189.55.21]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a41.g.dreamhost.com (Postfix) with ESMTPSA id 9084B44C055 for ; Wed, 6 Jul 2011 19:24:03 -0700 (PDT) Subject: Re: Running hadoop jobs against data in remote data center References: From: Aaron Morton Content-Type: text/plain; charset=us-ascii X-Mailer: iPad Mail (8J3) In-Reply-To: Message-Id: Date: Thu, 7 Jul 2011 14:23:54 +1200 To: "user@cassandra.apache.org" Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (iPad Mail 8J3) X-Virus-Checked: Checked by ClamAV on apache.org See http://www.datastax.com/dev/blog/deploying-cassandra-across-multiple-dat= a-centers and http://www.datastax.com/docs/0.8/brisk/about_brisk#about-the-b= risk-architecture It's possible to run multi DC and use LOCAL_QUORUM consistency level in your= production centre to allow the prod code to get on with it's life without w= orrying about the other DC. Hope that helps. ----------------- Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 7/07/2011, at 1:29 PM, Jason Baker wrote: > I'm just setting up a Cassandra cluster for my company. For a variety of r= easons, we have the servers that run our hadoop jobs in our local office and= our production machines in a collocated data center. We don't want to run h= adoop jobs against cassandra servers on the other side of the US from us, no= t to mention that we don't want them impacting performance in production. W= hat's the best way to handle this? >=20 > My first instinct is to add some servers locally to the node and use Netwo= rkTopologyStrategy. This way, the servers automatically get updated with th= e latest changes, and we get a bit of extra redundancy for our production ma= chine. Of course, the glaring weakness of this strategy is that our stats s= ervers aren't in a datacenter with any kind of production guarantees. The n= etwork connection is relatively slow and unreliable, the servers may go out a= t any time, and I generally don't want to tie our production performance or r= eliability to these servers. >=20 > Is this as dumb an idea as I suspect it is, or can this be made to work? := -) >=20 > Are there any better ways to accomplish what I'm trying to accomplish?