cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Scott Fines <>
Subject RE: MapReduce with two ethernet cards
Date Thu, 13 Oct 2011 18:17:23 GMT
When I look at the source for ColumnFamilyInputFormat, it appears that it does a call to client.describe_ring;
when you do the equivalent call  with nodetool, you get the 10.1.1.* addresses.  This seems
to indicate to me that I should open up the firewall and attempt to contact those IPs instead
of the normal thrift IPs. 

That leads me to think that I need to have thrift listening on both IPs, though. Would that
then be the case?

From: Scott Fines []
Sent: Thursday, October 13, 2011 12:40 PM
Subject: RE: MapReduce with two ethernet cards

The listen address on all machines are set to the 10.1.1.* addresses, while the thrift rpc
address is the 172.28.* addresses

From: Brandon Williams []
Sent: Thursday, October 13, 2011 12:28 PM
Subject: Re: MapReduce with two ethernet cards

What is your rpc_address set to?  If it's (bind everything)
then that's not going to work if listen_address is blocked.


On Thu, Oct 13, 2011 at 11:13 AM, Scott Fines <> wrote:
> I upgraded to cassandra 0.8.7, and the problem persists.
> Scott
> ________________________________________
> From: Brandon Williams []
> Sent: Monday, October 10, 2011 12:28 PM
> To:
> Subject: Re: MapReduce with two ethernet cards
> On Mon, Oct 10, 2011 at 11:47 AM, Scott Fines <> wrote:
>> Hi all,
>> This may be a silly question, but I'm at a bit of a loss, and was hoping for
>> some help.
>> I have a Cassandra cluster set up with two NICs--one for internel
>> communication between cassandra machines (10.1.1.*), and one to respond to
>> Thrift RPC (172.28.*.*).
>> I also have a Hadoop cluster set up, which, for unrelated reasons, has to
>> remain separate from Cassandra, so I've written a little MapReduce job to
>> copy data from Cassandra to Hadoop. However, when I try to run my job, I
>> get
>> failed connecting to all endpoints
>> which is puzzling to me. It seems like the MR is attempting to connect to
>> the internal communication IPs instead of the external Thrift IPs. Since I
>> set up a firewall to block external access to the internal IPs of Cassandra,
>> this is obviously going to fail.
>> So my question is: why does Cassandra MR seem to be grabbing the
>> listen_address instead of the Thrift one. Presuming it's not a funky
>> configuration error or something on my part, is that strictly necessary? All
>> told, I'd prefer if it was connecting to the Thrift IPs, but if it can't,
>> should I open up port 7000 or port 9160 between Hadoop and Cassandra?
>> Thanks for your help,
>> Scott
> Your cassandra is old, upgrade to the latest version.
> -Brandon

View raw message