Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 59DC4E4B6 for ; Fri, 18 Jan 2013 08:23:12 +0000 (UTC) Received: (qmail 23636 invoked by uid 500); 18 Jan 2013 08:23:09 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 23430 invoked by uid 500); 18 Jan 2013 08:23:09 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 22941 invoked by uid 99); 18 Jan 2013 08:23:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jan 2013 08:23:07 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of roland.gude@ez.no designates 213.199.154.209 as permitted sender) Received: from [213.199.154.209] (HELO am1outboundpool.messaging.microsoft.com) (213.199.154.209) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jan 2013 08:22:59 +0000 Received: from mail107-am1-R.bigfish.com (10.3.201.249) by AM1EHSOBE023.bigfish.com (10.3.207.145) with Microsoft SMTP Server id 14.1.225.23; Fri, 18 Jan 2013 08:22:36 +0000 Received: from mail107-am1 (localhost [127.0.0.1]) by mail107-am1-R.bigfish.com (Postfix) with ESMTP id 977FB2600EB for ; Fri, 18 Jan 2013 08:22:36 +0000 (UTC) X-Forefront-Antispam-Report: CIP:157.56.249.213;KIP:(null);UIP:(null);IPV:NLI;H:AM2PRD0710HT004.eurprd07.prod.outlook.com;RD:none;EFVD:NLI X-SpamScore: -3 X-BigFish: PS-3(zzbb2dI98dI9371Ic85fh1432Izz1ee6h1de0h1d18h1202h1e76h1d1ah1d2ahzz17326ah18c673h1954cbh8275eh8275bh8275dha1495iz2fh2a8h668h839hd25hf0ah1288h12a5h12bdh137ah1441h1504h1537h153bh15d0h162dh1631h1758h1155h) Received-SPF: pass (mail107-am1: domain of ez.no designates 157.56.249.213 as permitted sender) client-ip=157.56.249.213; envelope-from=roland.gude@ez.no; helo=AM2PRD0710HT004.eurprd07.prod.outlook.com ;.outlook.com ; Received: from mail107-am1 (localhost.localdomain [127.0.0.1]) by mail107-am1 (MessageSwitch) id 135849735441150_19428; Fri, 18 Jan 2013 08:22:34 +0000 (UTC) Received: from AM1EHSMHS002.bigfish.com (unknown [10.3.201.243]) by mail107-am1.bigfish.com (Postfix) with ESMTP id F2827460077 for ; Fri, 18 Jan 2013 08:22:33 +0000 (UTC) Received: from AM2PRD0710HT004.eurprd07.prod.outlook.com (157.56.249.213) by AM1EHSMHS002.bigfish.com (10.3.207.102) with Microsoft SMTP Server (TLS) id 14.1.225.23; Fri, 18 Jan 2013 08:22:33 +0000 Received: from AM2PRD0710MB362.eurprd07.prod.outlook.com ([169.254.10.58]) by AM2PRD0710HT004.eurprd07.prod.outlook.com ([10.255.165.39]) with mapi id 14.16.0257.004; Fri, 18 Jan 2013 08:22:33 +0000 From: Roland Gude To: "user@cassandra.apache.org" Subject: AW: Cassandra at Amazon AWS Thread-Topic: Cassandra at Amazon AWS Thread-Index: AQHN9Co+BpNkIwHUIkubmVfiDW9ZNphMdpkAgAESoICAAD0hgIAAr9wAgABKjWA= Date: Fri, 18 Jan 2013 08:22:32 +0000 Message-ID: <69989DC961D0DB4D805CA94CF4607371241FB542@AM2PRD0710MB362.eurprd07.prod.outlook.com> References: In-Reply-To: Accept-Language: de-DE, en-US Content-Language: de-DE X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [87.79.236.144] Content-Type: multipart/alternative; boundary="_000_69989DC961D0DB4D805CA94CF4607371241FB542AM2PRD0710MB362_" MIME-Version: 1.0 X-OriginatorOrg: ez.no X-Virus-Checked: Checked by ClamAV on apache.org --_000_69989DC961D0DB4D805CA94CF4607371241FB542AM2PRD0710MB362_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Priam is good for backups but it is another complex (but very good) part to= a software stack. A simple solution is to do regular snapshots (via cron) Compress them and put them into s3 On the s3 you can simply choose how many days the files are kept. This can be done with a couple of lines of shellscript. And a simple cronta= b entry Von: Marcelo Elias Del Valle [mailto:mvallebr@gmail.com] Gesendet: Freitag, 18. Januar 2013 04:53 An: user@cassandra.apache.org Betreff: Re: Cassandra at Amazon AWS Everyone, thanks a lot for the answer, they helped me a lot. 2013/1/17 Andrey Ilinykh > I'd recommend Priam. http://techblog.netflix.com/2012/02/announcing-priam.html Andrey On Thu, Jan 17, 2013 at 5:44 AM, Adam Venturella > wrote: Jared, how do you guys handle data backups for your ephemeral based cluster= ? I'm trying to move to ephemeral drives myself, and that was my last stickin= g point; asking how others in the community deal with backup in case the VM= explodes. On Wed, Jan 16, 2013 at 1:21 PM, Jared Biel > wrote: We're currently using Cassandra on EC2 at very low scale (a 2 node cluster on m1.large instances in two regions.) I don't believe that EBS is recommended for performance reasons. Also, it's proven to be very unreliable in the past (most of the big/notable AWS outages were due to EBS issues.) We've moved 99% of our instances off of EBS. As other have said, if you require more space in the future it's easy to add more nodes to the cluster. I've found this page (http://www.ec2instances.info/) very useful in determining the amount of space each instance type has. Note that by default only one ephemeral drive is attached and you must specify all ephemeral drives that you want to use at launch time. Also, you can create a RAID 0 of all local disks to provide maximum speed and space. On 16 January 2013 20:42, Marcelo Elias Del Valle > wrote: > Hello, > > I am currently using hadoop + cassandra at amazon AWS. Cassandra runs = on > EC2 and my hadoop process runs at EMR. For cassandra storage, I am using > local EC2 EBS disks. > My system is running fine for my tests, but to me it's not a good setu= p > for production. I need my system to perform well for specially for writes= on > cassandra, but the amount of data could grow really big, taking several T= b > of total storage. > My first guess was using S3 as a storage and I saw this can be done b= y > using Cloudian package, but I wouldn't like to become dependent on a > pre-package solution and I found it's kind of expensive for more than 100= Tb: > http://www.cloudian.com/pricing.html > I saw some discussion at internet about using EBS or ephemeral disks = for > storage at Amazon too. > > My question is: does someone on this list have the same problem as me= ? > What are you using as solution to Cassandra's storage when running it at > Amazon AWS? > > Any thoughts would be highly appreciatted. > > Best regards, > -- > Marcelo Elias Del Valle > http://mvalle.com - @mvallebr -- Marcelo Elias Del Valle http://mvalle.com - @mvallebr --_000_69989DC961D0DB4D805CA94CF4607371241FB542AM2PRD0710MB362_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Priam is g= ood for backups but it is another complex (but very good) part to a softwar= e stack.

A simple s= olution is to do regular snapshots (via cron)

Compress t= hem and put them into s3

On the s3 = you can simply choose how many days the files are kept.

 = ;

This can b= e done with a couple of lines of shellscript. And a simple crontab entry

 = ;

Von: Marcelo E= lias Del Valle [mailto:mvallebr@gmail.com]
Gesendet: Freitag, 18. Januar 2013 04:53
An: user@cassandra.apache.org
Betreff: Re: Cassandra at Amazon AWS

 

Everyone, thanks a lot for the answer, they helped m= e a lot.

 

2013/1/17 Andrey Ilinykh <ailinykh@gmail.com>

 

On Thu, Jan 17, 2013 at 5:44 AM, Adam Venturella <= ;aventurella@gma= il.com> wrote:

Jared, how do you guys handle data backups for your = ephemeral based cluster?

 

I'm trying to move to ephemeral drives myself, and t= hat was my last sticking point; asking how others in the community deal wit= h backup in case the VM explodes.

 

 

On Wed, Jan 16, 2013 at 1:21 PM, Jared Biel <jared.biel@b= olderthinking.com> wrote:

We're currently using Cassandra on EC2 at very low s= cale (a 2 node
cluster on m1.large instances in two regions.) I don't believe that
EBS is recommended for performance reasons. Also, it's proven to be
very unreliable in the past (most of the big/notable AWS outages were
due to EBS issues.) We've moved 99% of our instances off of EBS.

As other have said, if you require more space in the future it's easy
to add more nodes to the cluster. I've found this page
(http://www.ec2= instances.info/) very useful in determining the amount
of space each instance type has. Note that by default only one
ephemeral drive is attached and you must specify all ephemeral drives
that you want to use at launch time. Also, you can create a RAID 0 of
all local disks to provide maximum speed and space.



On 16 January 2013 20:42, Marcelo Elias Del Valle <mvallebr@gmail.com> wrote:
> Hello,
>
>    I am currently using hadoop + cassandra at amazon AWS= . Cassandra runs on
> EC2 and my hadoop process runs at EMR. For cassandra storage, I am usi= ng
> local EC2 EBS disks.
>    My system is running fine for my tests, but to me it's no= t a good setup
> for production. I need my system to perform well for specially for wri= tes on
> cassandra, but the amount of data could grow really big, taking severa= l Tb
> of total storage.
>     My first guess was using S3 as a storage and I saw this = can be done by
> using Cloudian package, but I wouldn't like to become dependent on a > pre-package solution and I found it's kind of expensive for more than = 100Tb:
> htt= p://www.cloudian.com/pricing.html
>     I saw some discussion at internet about using EBS or eph= emeral disks for
> storage at Amazon too.
>
>     My question is: does someone on this list have the same = problem as me?
> What are you using as solution to Cassandra's storage when running it = at
> Amazon AWS?
>
>     Any thoughts would be highly appreciatted.
>
> Best regards,
> --
> Marcelo Elias Del Valle
> http://mvalle.com = - @mvallebr

 

 



 

--
Marcelo Elias Del Valle
http://mvalle.com = - @mvallebr

--_000_69989DC961D0DB4D805CA94CF4607371241FB542AM2PRD0710MB362_--