Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 63204 invoked from network); 18 Apr 2010 20:07:29 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 18 Apr 2010 20:07:29 -0000 Received: (qmail 82157 invoked by uid 500); 18 Apr 2010 20:07:28 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 82128 invoked by uid 500); 18 Apr 2010 20:07:28 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 82120 invoked by uid 99); 18 Apr 2010 20:07:28 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 18 Apr 2010 20:07:28 +0000 X-ASF-Spam-Status: No, hits=2.9 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [74.125.92.25] (HELO qw-out-2122.google.com) (74.125.92.25) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 18 Apr 2010 20:07:21 +0000 Received: by qw-out-2122.google.com with SMTP id 3so380046qwe.61 for ; Sun, 18 Apr 2010 13:07:00 -0700 (PDT) MIME-Version: 1.0 Received: by 10.229.189.130 with HTTP; Sun, 18 Apr 2010 13:06:59 -0700 (PDT) In-Reply-To: References: Date: Sun, 18 Apr 2010 16:06:59 -0400 Received: by 10.229.217.148 with SMTP id hm20mr5892164qcb.38.1271621220022; Sun, 18 Apr 2010 13:07:00 -0700 (PDT) Message-ID: Subject: Help with MapReduce From: Joost Ouwerkerk To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=00163630fe271bbfb304848863c0 X-Virus-Checked: Checked by ClamAV on apache.org --00163630fe271bbfb304848863c0 Content-Type: text/plain; charset=ISO-8859-1 I'm a Cassandra noob trying to validate Cassandra as a viable alternative to HBase (which we've been using for over a year) for our application. So far, I've had no success getting Cassandra working with MapReduce. My first step is inserting data into Cassandra. I've created a MapRed job based using the fat client API. I'm using the fat client (StorageProxy) because that's what ColumnFamilyInputFormat uses and I want to use the same API for both read and write jobs. When I call StorageProxy.mutate(), nothing happens. The job completes as if it had done something, but in fact nothing has changed in the cluster. When I call StorageProxy.mutateBlocking(), I get an IOException complaining that there is no connection to the cluster. I've concluded with the debugger that StorageService is not connecting to the cluster, even though I've specified the correct seed and ListenAddress (I've using the exact same storage-conf.xml as the nodes in the cluster). I'm sure I'm missing something obvious in the configuration or my setup, but since I'm new to Cassandra, I can't see what it is. Any help appreciated, Joost --00163630fe271bbfb304848863c0 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I'm a Cassandra noob trying to validate Cassandra as a viable alternati= ve to HBase (which we've been using for over a year) for our applicatio= n.=A0 So far, I've had no success getting Cassandra working with MapRed= uce.=A0

My first step is inserting data into Cassandra.=A0 I've created a M= apRed job based using the fat client API.=A0 I'm using the fat client (= StorageProxy) because that's what ColumnFamilyInputFormat uses and I wa= nt to use the same API for both read and write jobs.

When I call StorageProxy.mutate(), nothing happens.=A0 The job complete= s as if it had done something, but in fact nothing has changed in the clust= er.=A0 When I call StorageProxy.mutateBlocking(), I get an IOException comp= laining that there is no connection to the cluster.=A0 I've concluded w= ith the debugger that StorageService is not connecting to the cluster, even= though I've specified the correct seed and ListenAddress (I've usi= ng the exact same storage-conf.xml as the nodes in the cluster).

I'm sure I'm missing something obvious in the configuration or = my setup, but since I'm new to Cassandra, I can't see what it is.
Any help appreciated,
Joost


--00163630fe271bbfb304848863c0--