Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 98131 invoked from network); 18 Aug 2010 09:53:03 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 18 Aug 2010 09:53:03 -0000 Received: (qmail 35740 invoked by uid 500); 18 Aug 2010 09:53:01 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 35493 invoked by uid 500); 18 Aug 2010 09:52:58 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 35484 invoked by uid 99); 18 Aug 2010 09:52:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Aug 2010 09:52:57 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of staeff@gmail.com designates 209.85.216.44 as permitted sender) Received: from [209.85.216.44] (HELO mail-qw0-f44.google.com) (209.85.216.44) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Aug 2010 09:52:50 +0000 Received: by qwc9 with SMTP id 9so327365qwc.31 for ; Wed, 18 Aug 2010 02:52:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:reply-to:date :message-id:subject:from:to:content-type; bh=KqqdKxRllgDUNjlGa85xeeG3EhZVV5ZlgrIQAQ/6JKo=; b=jbxB5/o0/JGVQvFTzA/G3DkFy4s8tq+lXrRYMlAQyASXWnEomPA4Rw/9s1CoTHLPf2 EA5LoryB3o3DVM1rUrax0fH4QHmu3s1Zn/mDQOahVteFptTOkIo1gukzVfQ27e0BA74g ROLotM3MVOzrsTfgxL6qak9pZL7DcnitQdeYw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:reply-to:date:message-id:subject:from:to:content-type; b=pjDSdSO+AiBXDABZyf6OKMQfm2T0vCPDItao0MjO4XkDv+ACVzFhTn1+jJZf07QJx/ z+bBsznh3gWaz01T0fSciDl1PNsWMEe8Z7neLLNavQBU2vZrqZVH/UrKEW6hWcoLNLY7 06SF3CTCHroFbBooYuQ+hfHaEPASPeljGj6yI= MIME-Version: 1.0 Received: by 10.229.222.77 with SMTP id if13mr5850310qcb.26.1282125149272; Wed, 18 Aug 2010 02:52:29 -0700 (PDT) Received: by 10.229.247.130 with HTTP; Wed, 18 Aug 2010 02:52:29 -0700 (PDT) Reply-To: staeff@staeff.de Date: Wed, 18 Aug 2010 09:52:29 +0000 Message-ID: Subject: uncomplete RangeSlices From: Stefan Kaufmann To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016363104c5147ce4048e1606b8 X-Virus-Checked: Checked by ClamAV on apache.org --0016363104c5147ce4048e1606b8 Content-Type: text/plain; charset=ISO-8859-1 During my tests, I found some strange bootstrap + get Range Slice behavior. In my testing environment im creating Datasets which look like those: sampleDPUIDforTestingIssues1/00000000000 -> value = 123456789 sampleDPUIDforTestingIssues1/00000999999 -> value = 123456789 i create 1 Million of those Keys (increasing number at the end) and then I have a query-script which uses get RangeSlices (10 000 Keys per Slices) in a loop to query all keys. The script checks if all key's could be queried. When I use the Bootstrap option to add a new node, mye query script shows me a large amount of missing keys. (looks like the same key-range from one node in the ring). However, when I try to query one single key out of the range (not using slices), I can read the value without any problem. My Script says: Following errors for sampleDPUIDforTestingIssues1 collection found: 471886 - 736000 are missing. This is the ring configuration: Address Status Load Range Ring sampleDPUIDforTestingIssues1/00000736000 10.20.10.112 Up 252.67 MB qaDOmXnF48Q4sZks |<--| 10.20.10.111 Up 252.67 MB sampleDPUIDforTestingIssues1/00000471885 | | 10.20.10.110 Up 252.66 MB sampleDPUIDforTestingIssues1/00000736000 |-->| When I add new nodes, by coping the storage/Keyspace data instead of bootstrapping - everything works fine. I searched the Issues-DB, and I found those two old ones.. (I'm not sure, if they really have to do something with this) https://issues.apache.org/jira/browse/CASSANDRA-682 https://issues.apache.org/jira/browse/CASSANDRA-902 --0016363104c5147ce4048e1606b8 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable During my tests, I found some strange bootstrap + get Range Slice behavior.=

In my testing environment im creating Datasets which look like thos= e:
sampleDPUIDforTestingIssues1/00000000000
=A0-> value =3D 123456= 789
sampleDPUIDforTestingIssues1/00000999999
=A0-> value =3D 123456789
i create 1 Million of those Keys (increasing number at the end) and th= en I have a query-script which uses get RangeSlices (10 000 Keys per Slices= ) in a loop to query all keys. The script checks if all key's could be = queried.

When I use the Bootstrap option to add a new node, mye query script sho= ws me a large amount of missing keys. (looks like the same key-range from o= ne node in the ring). However, when I try to query one single key out of th= e range (not using slices), I can read the value without any problem.


My Script says:
Following errors for sampleDPUIDforTestingIssues= 1 collection found:
471886 - 736000 are missing.

This is the rin= g configuration:
Address=A0=A0=A0=A0=A0=A0 Status=A0=A0=A0=A0 Load=A0=A0= =A0=A0=A0=A0=A0=A0=A0 Range=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 Ring
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 sampleDPUIDforTestingIssues1/000007= 36000=A0=A0
10.20.10.112=A0 Up=A0=A0=A0=A0=A0=A0=A0=A0 252.67 MB=A0=A0= =A0=A0 qaDOmXnF48Q4sZks=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0= =A0=A0=A0=A0=A0=A0=A0=A0 =A0 =A0 =A0 =A0 =A0=A0=A0 |<--|
10.20.10.111= =A0 Up=A0=A0=A0=A0=A0=A0=A0=A0 252.67 MB=A0=A0=A0=A0 sampleDPUIDforTestingI= ssues1/00000471885=A0=A0 |=A0=A0 |
10.20.10.110=A0 Up=A0=A0=A0=A0=A0=A0=A0=A0 252.66 MB=A0=A0=A0=A0 sampleDPUI= DforTestingIssues1/00000736000=A0=A0 |-->|


When I add new nod= es, by coping the storage/Keyspace data instead of bootstrapping - everythi= ng works fine.

I searched the Issues-DB, and I found those two old o= nes..
(I'm not sure, if they really have to do something with this)

https://issue= s.apache.org/jira/browse/CASSANDRA-682
https://issues.apache.org/jira/browse/CA= SSANDRA-902
--0016363104c5147ce4048e1606b8--