Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 63504 invoked from network); 27 Oct 2010 18:07:19 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 27 Oct 2010 18:07:19 -0000 Received: (qmail 23598 invoked by uid 500); 27 Oct 2010 17:40:59 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 23585 invoked by uid 500); 27 Oct 2010 17:40:59 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 23577 invoked by uid 99); 27 Oct 2010 17:40:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Oct 2010 17:40:56 +0000 X-ASF-Spam-Status: No, hits=2.9 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.214.44] (HELO mail-bw0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Oct 2010 17:40:49 +0000 Received: by bwz12 with SMTP id 12so830562bwz.31 for ; Wed, 27 Oct 2010 10:40:26 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.69.65 with SMTP id y1mr2139879bki.23.1288201226387; Wed, 27 Oct 2010 10:40:26 -0700 (PDT) Received: by 10.103.71.37 with HTTP; Wed, 27 Oct 2010 10:40:26 -0700 (PDT) X-Originating-IP: [64.132.24.248] In-Reply-To: References: Date: Wed, 27 Oct 2010 12:40:26 -0500 Message-ID: Subject: Re: Cluster load balancing? From: Tyler Hobbs To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001636c5a8697f86b004939cb85b --001636c5a8697f86b004939cb85b Content-Type: text/plain; charset=ISO-8859-1 With OrderPreservingPartitioner, you have to keep the ring balanced manually. This is why people frequently suggest that you use RandomPartitioner unless you absolutely have to do otherwise. With OPP, keys are *not* evenly distributed around the ring. Apparently you have lots of keys that are between ~'t' and 'x', so start bunching your tokens there. - Tyler On Wed, Oct 27, 2010 at 12:00 PM, Thibaut Britz < thibaut.britz@trendiction.com> wrote: > Hi, > > I have a little java hector test application running whcih writes and reads > data to my little cassandra cluster (7 nodes). > > The data doesn't get loadbalanced at all: > > 192.168.1.12 Up 178.32 MB > 8S6VvT7oKNcQTso3 |<--| > 192.168.1.14 Up 30.12 MB > 9tybk3nB6JCtqQU1 | ^ > 192.168.1.15 Up 11.96 MB > RZVG3NC3ksqjEmYE v | > 192.168.1.16 Up 668.7 KB > aTV6W12YxxMI31Z8 | ^ > 192.168.1.10 Up 22.86 GB > u5iaQxEfyUSwnPn1 v | > 192.168.1.13 Up 22.5 GB > vZlWeU8b6LBeAcAY | ^ > 192.168.1.11 Up 22.27 GB > xrmaUS6nnrYFSk8e |-->| > > What could be the issue? I couldn't find anything in the FAQ related to > this > > Will data (writes) always be added to the server I connect to? If so, why > will the replicas then always be stored on the same 2 other machines. > > (Tested with > org.apache.cassandra.dht.OrderPreservingPartitioner > on 0.6.5 and replication level 3) > > Thanks, > Thibaut > > > > --001636c5a8697f86b004939cb85b Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable With OrderPreservingPartitioner, you have to keep the ring balanced manuall= y.
This is why people frequently suggest that you use RandomPartitioner = unless
you absolutely have to do otherwise.=A0 With OPP, keys are *not* = evenly distributed
around the ring.

Apparently you have lots of keys that are between ~= 't' and 'x', so start bunching
your tokens there.
- Tyler

On Wed, Oct 27, 2010 at 12:00 PM= , Thibaut Britz <thibaut.britz@trendiction.com> wrote:
Hi,

I have= a little java hector test application running whcih writes and reads data = to my little cassandra cluster (7 nodes).

The data doesn't get loadbalanced at all:

192.168.1.12 Up=A0= =A0=A0=A0=A0=A0=A0=A0 178.32 MB=A0=A0=A0=A0 8S6VvT7oKNcQTso3=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 |<--| 192.168.1.14 Up=A0=A0=A0=A0=A0=A0=A0=A0 30.12 MB=A0=A0=A0=A0=A0 9tybk3nB6JC= tqQU1=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0 |=A0=A0 ^
192.168.1.15 Up=A0=A0=A0=A0=A0=A0=A0=A0 11.96 MB=A0= =A0=A0=A0=A0 RZVG3NC3ksqjEmYE=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 v=A0=A0 |
192.168.1.16 Up=A0=A0=A0=A0= =A0=A0=A0=A0 668.7 KB=A0=A0=A0=A0=A0 aTV6W12YxxMI31Z8=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 |=A0=A0 ^
192.168.1.10 Up=A0=A0=A0=A0=A0=A0=A0=A0 22.86 GB=A0=A0=A0=A0=A0 u5iaQxEfyUS= wnPn1=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0 v=A0=A0 |
192.168.1.13 Up=A0=A0=A0=A0=A0=A0=A0=A0 22.5 GB=A0= =A0=A0=A0=A0=A0 vZlWeU8b6LBeAcAY=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 |=A0=A0 ^
192.168.1.11 Up=A0=A0=A0= =A0=A0=A0=A0=A0 22.27 GB=A0=A0=A0=A0=A0 xrmaUS6nnrYFSk8e=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 |-->|

What could be the issue? I couldn't find anything in the FAQ relate= d to this

Will data (writes) always be added to the server I connect= to? If so, why will the replicas then always be stored on the same 2 other= machines.

(Tested with <Partitioner>org.apache.cassandra.dht.OrderPreservin= gPartitioner</Partitioner> on 0.6.5 and replication level 3)

T= hanks,
Thibaut




--001636c5a8697f86b004939cb85b--