Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8F36F725F for ; Wed, 7 Dec 2011 20:05:03 +0000 (UTC) Received: (qmail 14539 invoked by uid 500); 7 Dec 2011 20:05:01 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 14514 invoked by uid 500); 7 Dec 2011 20:05:01 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 14506 invoked by uid 99); 7 Dec 2011 20:05:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Dec 2011 20:05:01 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of potekhin@bnl.gov designates 130.199.3.132 as permitted sender) Received: from [130.199.3.132] (HELO smtpgw.bnl.gov) (130.199.3.132) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Dec 2011 20:04:52 +0000 X-BNL-policy-q: X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AkkMAJnF306Cx1r6/2dsb2JhbABDh1+gQIUtAQEFOEARCxgJFg8JAwIBAgFFEwgBAb55iByDGASILp5+ X-IronPort-AV: E=Sophos;i="4.71,315,1320642000"; d="scan'208";a="155328517" Received: from dh10.s90.bnl.gov (HELO [130.199.90.250]) ([130.199.90.250]) by smtpgw.sec.bnl.local with ESMTP; 07 Dec 2011 15:04:31 -0500 Message-ID: <4EDFC6CF.8000101@bnl.gov> Date: Wed, 07 Dec 2011 15:04:31 -0500 From: Maxim Potekhin Reply-To: potekhin@bnl.gov Organization: Brookhaven National Laboratory User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:8.0) Gecko/20111105 Thunderbird/8.0 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: exporting data from Cassandra cluster References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hello Alexandru, as you probably know, my group is using Amazon S3 to permanently (or sem-permanently) park the data in CSV format, which makes it portable and we can load it into anything if needed, or analyze on its own. Just my half of a Swiss centime :) And, because the S3 option is not for everybody, and since you are at CERN, -- talk to data people in ATLAS. 350GB seems trivial. Regards Maxim On 12/7/2011 11:17 AM, Alexandru Dan Sicoe wrote: > Hello everyone. > 3 node Cassandra 0.8.5 cluster. I've left the system running in > production environment for long term testing. I've accumulated about > 350GB of data with RF=2. The machines I used for the tests are older > and need to be replaced. Because of this I need to export the data to > a permanent location. How should I export the data? In order to reduce > the storage spac I want to export only the non-replicated data? I > mean, just one copy of the data (without the replicas). Is this > possible? How? > > Cheers, > Alexandru >