cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From aaron morton <aa...@thelastpickle.com>
Subject Re: two-node cassandra cluster
Date Sun, 26 Aug 2012 21:28:24 GMT
> most of the time and then on burst days I want to bring one more
> server up (with more RAM and CPU than the first) to help serve the
> load.
Unless you are using virtual nodes (coming in 1.2) and I higher RF  I would recommend using
machines that have the same HW spec. Otherwise you need to capacity plan against the combined
machine with the lowest spec components from all machines. If you are scaling for throughput
you will need to keep this in mind. With 2 nodes and RF 2 both nodes have to do the same amount
of work. 

In _general_ the assumption is that cluster membership is reasonably stable. Routinely scaling
up and down will be fighting things a little. 
 
To bring the node up I would:
* start it with auto_bootstrap off. 
* copy over all the data from the first node
* run repair. 

To decommission I would:
* run repair on the always on node.
* turn off the additional node
* run nodetool removetoken on the always on node to remove the additional node. 

If you want to go down this path make sure you do a lot of testing to get the process ironed
out. I would be thinking about: 

Does the cluster have to be continuously available ? 
How much data are we talking about ? How long will it take to transfer ?
What will the network latency be like between the nodes ? Latency between new nodes can be
a lottery. 
If you are storing the data on a single node that uses RAID 0, how will you handle disk failure
?

Hope that helps. 

-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 24/08/2012, at 10:25 PM, Jason Axelson <jason@engagestage.com> wrote:

> Hi, I have an application that will be very dormant most of the time
> but will need high-bursting a few days out of the month. Since we are
> deploying on EC2 I would like to keep only one Cassandra server up
> most of the time and then on burst days I want to bring one more
> server up (with more RAM and CPU than the first) to help serve the
> load. What is the best way to do this? Should I take a different
> approach?
> 
> Some notes about what I plan to do:
> * Bring the node up and repair it immediately
> * After the burst time is over decommission the powerful node
> * Use the always-on server as the seed node
> * My main question is how to get the nodes to share all the data since
> I want a replication factor of 2 (so both nodes have all the data) but
> that won't work while there is only one server. Should I bring up 2
> extra servers instead of just one?
> 
> Thanks,
> Jason


Mime
View raw message