Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C6E6A11D41 for ; Thu, 24 Jul 2014 10:07:54 +0000 (UTC) Received: (qmail 38430 invoked by uid 500); 24 Jul 2014 10:07:51 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 38396 invoked by uid 500); 24 Jul 2014 10:07:51 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 38386 invoked by uid 99); 24 Jul 2014 10:07:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 10:07:51 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of al3xdm@gmail.com designates 209.85.192.49 as permitted sender) Received: from [209.85.192.49] (HELO mail-qg0-f49.google.com) (209.85.192.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 10:07:48 +0000 Received: by mail-qg0-f49.google.com with SMTP id j107so2943335qga.8 for ; Thu, 24 Jul 2014 03:07:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=v2agRLolf4MwIjkZgdcdd+zQRhBdrEr/ZSI5rszNaKA=; b=AI/whwcpJp0ji9Y5A96jB/DTSyHkoQjuV/Iun4U8QY+55wkNRnm+DqyO9nQBqrW2gT pbGvOOhC2LKNHtCudfLLsRrcCQxa+GUJQH5aPj9nVP0i3Wc53/CMlRYZQ0jH5H4hAcS4 zKVM0ksSnNcIy0FiIAsHWjDv5ocMGbueuqmqPCycY32NOlXwhXUjuVhByvd6vgKHjI8V Bk7ZzF7lvOys93YPPPRp2QNxjuUOnEsmw7QN+MfA0kI/aWO7zqOW/BSEnAsHx0WO0rDp 9KffqKsiR3G8UezRIMbA/szMeLC3i8ztLtiSxzuJM0OdkWHt/6zrK8hmzNoF+lz+wkDJ TsxA== MIME-Version: 1.0 X-Received: by 10.224.88.71 with SMTP id z7mr12519871qal.94.1406196443958; Thu, 24 Jul 2014 03:07:23 -0700 (PDT) Received: by 10.96.188.98 with HTTP; Thu, 24 Jul 2014 03:07:23 -0700 (PDT) In-Reply-To: References: Date: Thu, 24 Jul 2014 11:07:23 +0100 Message-ID: Subject: Re: Cassandra on AWS suggestions for data safety From: Alex Major To: "user@cassandra.apache.org" Content-Type: multipart/alternative; boundary=001a1133da3086d79404feed9f9a X-Virus-Checked: Checked by ClamAV on apache.org --001a1133da3086d79404feed9f9a Content-Type: text/plain; charset=UTF-8 On Thu, Jul 24, 2014 at 12:12 AM, Hao Cheng wrote: > Hello, > > Based on what I've read in the archives here and on the documentation on > Datastax and the Cassandra Community, EBS volumes, even provisioned IOPS > with EBS optimized instances, are not recommended due to inconsistent > performance. This I can deal with, but I was hoping for some > recommendations from the community as far as solutions for data safety. > > I have a few ideas in mind: > > 1. Instance store for the database, then cassandra snapshots (via > nodetool), stored on an EBS provisioned IOPS volume attached to the > instance. That volume would serve to keep the DB safe in case of instance > downtime, and I would set up regular snapshotting on the EBS volume for > data safety (pushed to S3 and eventually glacier) > > 2. Instance store used as a bcache write-through cache for attached EBS > volumes. The attached volumes persist all writes and are again snapshotted > regularly. > > 3. Using a backup system, either manually via rsync or through something > like Priam, to directly push backups of the data on ephemeral storage to S3. > > From where I'm sitting, #2 seems the easiest to set up, but could > potentially cause problems if the EBS volume backing writes sees a spike in > latency, driving up write times even if read times would remain fairly > consistent. > > Do any of you all have recommendations or suggestions for a system like > this? > > Thanks in advance! > > --Bryan > We have a cluster running that uses EBS with Provisioned Iops and we get good performance off them (comparable to instance store). The reason we're moving off them is purely because EBS has been the thing that most often crashes on AWS. The AWS SSD instance types are where we're heading and I'd recommend them if you can. Also make sure to keep at least 3 replicas, things tend to fail more regularly so it'll keep you from having immediate problems. Our setup is to snapshot the instance stores and sync to S3. Not sure why you'd sync to EBS really. Priam which you mentioned makes keeping backups (snapshots) and storing them on S3 really simple - https://github.com/Netflix/Priam/wiki/Backups --001a1133da3086d79404feed9f9a Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
On T= hu, Jul 24, 2014 at 12:12 AM, Hao Cheng <bryan@critica.io> wr= ote:
Hello,

Based on what I've read in t= he archives here and on the documentation on Datastax and the Cassandra Com= munity, EBS volumes, even provisioned IOPS with EBS optimized instances, ar= e not recommended due to inconsistent performance. This I can deal with, bu= t I was hoping for some recommendations from the community as far as soluti= ons for data safety.

I have a few ideas in mind:

1.= Instance store for the database, then cassandra snapshots (via nodetool), = stored on an EBS provisioned IOPS volume attached to the instance. That vol= ume would serve to keep the DB safe in case of instance downtime, and I wou= ld set up regular snapshotting on the EBS volume for data safety (pushed to= S3 and eventually glacier)

2. Instance store used as a bcache write-through cache = for attached EBS volumes. The attached volumes persist all writes and are a= gain snapshotted regularly.

3. Using a backup syst= em, either manually via rsync or through something like Priam, to directly = push backups of the data on ephemeral storage to S3.

From where I'm sitting, #2 seems the easiest to set= up, but could potentially cause problems if the EBS volume backing writes = sees a spike in latency, driving up write times even if read times would re= main fairly consistent.

Do any of you all have recommendations or suggestions f= or a system like this?

Thanks in advance!

--Bryan

We have a cluster r= unning that uses EBS with Provisioned Iops and we get good performance off = them (comparable to instance store). The reason we're moving off them i= s purely because EBS has been the thing that most often crashes on AWS. The= AWS SSD instance types are where we're heading and I'd recommend t= hem if you can. Also make sure to keep at least 3 replicas, things tend to = fail more regularly so it'll keep you from having immediate problems.
Our setup is to snapshot the instance stores and sync to S3.= Not sure why you'd sync to EBS really. Priam which you mentioned makes= keeping backups (snapshots) and storing them on S3 really simple -=C2=A0https://github.com/= Netflix/Priam/wiki/Backups
--001a1133da3086d79404feed9f9a--