Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EA174980D for ; Fri, 24 Feb 2012 14:22:01 +0000 (UTC) Received: (qmail 29801 invoked by uid 500); 24 Feb 2012 14:21:59 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 29782 invoked by uid 500); 24 Feb 2012 14:21:59 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 29774 invoked by uid 99); 24 Feb 2012 14:21:59 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Feb 2012 14:21:59 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of adsicoe@gmail.com designates 209.85.214.44 as permitted sender) Received: from [209.85.214.44] (HELO mail-bk0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Feb 2012 14:21:53 +0000 Received: by bkwj4 with SMTP id j4so545179bkw.31 for ; Fri, 24 Feb 2012 06:21:31 -0800 (PST) Received-SPF: pass (google.com: domain of adsicoe@gmail.com designates 10.204.145.145 as permitted sender) client-ip=10.204.145.145; Authentication-Results: mr.google.com; spf=pass (google.com: domain of adsicoe@gmail.com designates 10.204.145.145 as permitted sender) smtp.mail=adsicoe@gmail.com; dkim=pass header.i=adsicoe@gmail.com Received: from mr.google.com ([10.204.145.145]) by 10.204.145.145 with SMTP id d17mr1264320bkv.77.1330093291935 (num_hops = 1); Fri, 24 Feb 2012 06:21:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; bh=wg3yDKRDh31BMGE/GOLvtTYy1ztvugE6xPx3e02oauA=; b=pEFvDmQUoobDdbAD47xHH9HLBW8q1JBPvUhShDcwn/+FshlVddoF7lxlIZjkQjwAyU bYZRitQH2rWjbUKcNRgjB1YPxjhjKKtDclguohQkRFi0fRkAabrFelSQsBMN9dBnNpM5 y+ByPTlWGQm9dR8+vE2xqlYgz1Ehk6l+yKbTc= MIME-Version: 1.0 Received: by 10.204.145.145 with SMTP id d17mr1041867bkv.77.1330093291828; Fri, 24 Feb 2012 06:21:31 -0800 (PST) Received: by 10.205.121.133 with HTTP; Fri, 24 Feb 2012 06:21:31 -0800 (PST) Date: Fri, 24 Feb 2012 15:21:31 +0100 Message-ID: Subject: unidirectional communication/replication From: Alexandru Sicoe To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0015175cd8262d746704b9b67a2b --0015175cd8262d746704b9b67a2b Content-Type: text/plain; charset=ISO-8859-1 Hello everyone, I'm battling with this contraint that I have: I need to regularly ship out timeseries data from a Cassandra cluster that sits within an enclosed network, outside of the network. I tried to select all the data within a certian time window, writing to a file, and then copying the file out but this hits the I/O performance because even for a small time window (say 5mins) I am hitting more than a million rows. It would really help if I used Cassandra to replicate the data automatically outside. The problem is they will only allow me to have outbound traffic out of the enclosed network (not inbound). Is there any way to configure the cluster or have 2 data centers in such a way that the data center (node or cluster) outside of the enclosed network only gets a replica of the data, without ever needing to communicate anything back? I appreciate the help, Alex --0015175cd8262d746704b9b67a2b Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hello everyone,

I'm battling with this contraint that I have: I = need to regularly ship out timeseries data from a Cassandra cluster that si= ts within an enclosed network, outside of the network.

I tried to s= elect all the data within a certian time window, writing to a file, and the= n copying the file out but this hits the I/O performance because even for a= small time window (say 5mins) I am hitting more than a million rows.

It would really help if I used Cassandra to replicate the data automati= cally outside. The problem is they will only allow me to have outbound traf= fic out of the enclosed network (not inbound). Is there any way to configur= e the cluster or have 2 data centers in such a way that the data center (no= de or cluster) outside of the enclosed network only gets a replica of the d= ata, without ever needing to communicate anything back?

I appreciate the help,
Alex
--0015175cd8262d746704b9b67a2b--