Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 06ABFC800 for ; Sat, 6 Jul 2013 20:51:14 +0000 (UTC) Received: (qmail 80561 invoked by uid 500); 6 Jul 2013 20:51:11 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 80495 invoked by uid 500); 6 Jul 2013 20:51:11 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 80487 invoked by uid 99); 6 Jul 2013 20:51:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 06 Jul 2013 20:51:11 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of rodrigofelixdealmeida@gmail.com designates 209.85.220.47 as permitted sender) Received: from [209.85.220.47] (HELO mail-pa0-f47.google.com) (209.85.220.47) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 06 Jul 2013 20:51:05 +0000 Received: by mail-pa0-f47.google.com with SMTP id kl14so3168837pab.34 for ; Sat, 06 Jul 2013 13:50:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=0tsqnkpNGfPLtWhwpzPopHaIWMh+o266jKw9SB4vPt0=; b=leC1Kwx0uEg3pqcE97Rrgs9AWlhIsc1U2CTo643QvSG0csk5IQ1mXbZwcUePnWUCrz CWtW1+s8xip9mPNKRDwUrw6zMrcIjIxwdPbt/eFuyh3S1HvXZSxQG33LXvYROASnJ38n IgFlI01kYNZan3ujMPpplHMBLZ0BCOKb3hbRP+z08CFSBa/bKxe0GpPFpPijpYnc+c/v 3ltXuecTXECXhig72LmhzTt9ec2Qt5Pw4Y61+QX/saGnm/MqlX7qxiyQ/CsMTZxwzPxx F4RpBXxaTW/UlmYbYVuijhy0NRsN7x2UbuKHgYQyBY2OoJPQ+0gY4uLT67AtK9rXR3J3 KXxg== X-Received: by 10.66.194.13 with SMTP id hs13mr16781149pac.152.1373143845556; Sat, 06 Jul 2013 13:50:45 -0700 (PDT) MIME-Version: 1.0 Received: by 10.66.172.172 with HTTP; Sat, 6 Jul 2013 13:50:25 -0700 (PDT) From: Rodrigo Felix Date: Sat, 6 Jul 2013 17:50:25 -0300 Message-ID: Subject: General doubts about bootstrap To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=047d7bf1641023d6f804e0ddf708 X-Virus-Checked: Checked by ClamAV on apache.org --047d7bf1641023d6f804e0ddf708 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi, I'm facing some problems and if you could help on some of them I'd thank you. *Environment:* 2 seeds and 2 other nodes, all installed on m1.large EC2 instances. Each seed starts with about 1.7GB of data. Default cassandra configuration. - Is it normal to take about 9 minutes to add a new node? Follows the log generated by a script to add a new node. [06/07/2013 20:07:53] Remove all data stored in the Cassandra node [06/07/2013 20:07:54] [OK] All data successfully removed [06/07/2013 20:07:54] Setting seeds on cassandra.yml [06/07/2013 20:07:54] [OK] seeds successfully set [06/07/2013 20:07:54] Setting listen_address on cassandra.yml [06/07/2013 20:07:54] [OK] listen_address successfully set [06/07/2013 20:07:54] Setting initial_token on cassandra.yml [06/07/2013 20:07:54] [OK] initial_token successfully set *[06/07/2013 20:07:54] Starting cassandra...* *[06/07/2013 20:16:36] [OK] Cassandra started* [06/07/2013 20:16:37] Changing token of i-5cfc082f [06/07/2013 20:18:00] [OK] Token of i-5cfc082f successfully set to 56713727820156410577229101238628035242 [06/07/2013 20:18:00] Cleaning up i-5cfc082f [06/07/2013 20:20:13] Clean up of i-5cfc082f successfully finished [06/07/2013 20:20:13] Machine added - Is there a way to reduce the time to start cassandra? - Sometimes cleanup operation takes make minutes (about 10). Is this normal since the amount of data is small (1.7gb at maximum / seed)? - Considering that I have two seeds in the beginning, their tokens are 0 and 85070591730234615865843651857942052864. When I add a new machine, do= I need to execute move and cleanup on both seeds? Nowadays, I'm running cleanup on seed 0, move + cleanup on the other seed and neither move nor cleanup on the just added node. Is this OK? - What if I do not run cleanup in any existing node when adding or removing a node? Is the data that was not "cleaned up" still available i= f I send a scan, for instance, and the scan range is still in the node but i= t wouldn't be there if I had run cleanup? Data would be gather from other node, ie. the one that properly has the range specified in the scan quer= y? - After decommissioning a node, is it advisable to run cleanup in the remaining nodes? The consequences of not to run are the same of not to r= un when adding a node? Thank you very much in advance. Att. *Rodrigo Felix de Almeida* LSBD - Universidade Federal do Cear=E1 Project Manager MBA, CSM, CSPO, SCJP --047d7bf1641023d6f804e0ddf708 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi,

=A0 = =A0I'm facing some problems and if you could help on some of them I'= ;d thank you.
=A0 =A0Environment: 2 seeds and 2 othe= r nodes, all installed on m1.large EC2 instances. Each seed starts with abo= ut 1.7GB of data. Default cassandra configuration.
  • Is it normal to take about 9 minutes to add = a new node? Follows the log generated by a script to add a new node.
[06/07/2013 20:07:53] Remove all data stored in the Cassan= dra node
[06/07/2013 20:07:54] [OK] All data successfully removed
[06= /07/2013 20:07:54] Setting seeds on cassandra.yml
[06/07/2013 20:= 07:54] [OK] seeds successfully set
[06/07/2013 20:07:54] Setting = listen_address on cassandra.yml
[06/07/2013 20:07:54] [OK] listen_address successfully set
[= 06/07/2013 20:07:54] Setting initial_token on cassandra.yml
[06/0= 7/2013 20:07:54] [OK] initial_token successfully set
[06/07/20= 13 20:07:54] Starting cassandra...
[06/07/2013 20:16:36] [OK] Cassandra started
[06/07/2= 013 20:16:37] Changing token of i-5cfc082f
[06/07/2013 20:18:00] = [OK] Token of i-5cfc082f successfully set to 567137278201564105772291012386= 28035242
[06/07/2013 20:18:00] Cleaning up i-5cfc082f
[06/07/2013 20:= 20:13] Clean up of i-5cfc082f successfully finished
[06/07/2013 2= 0:20:13] Machine added
  • Is there a way = to reduce the time to start cassandra?
  • Sometimes cleanup operation takes make minutes (about 10). Is thi= s normal since the amount of data is small (1.7gb at maximum / seed)?
  • <= li style>Considering that I have two seeds in the beginning, their tokens a= re 0 and=A085070591730234615865843651857942052864. When I add a new machine= , do I need to execute move and cleanup on both seeds? Nowadays, I'm ru= nning cleanup on seed 0, move + cleanup on the other seed and neither move = nor cleanup on the just added node. Is this OK?
  • What if I do not run cleanup in any existing node when adding or = removing a node? Is the data that was not "cleaned up" still avai= lable if I send a scan, for instance, and the scan range is still in the no= de but it wouldn't be there if I had run cleanup? Data would be gather = from other node, ie. the one that properly has the range specified in the s= can query?
  • After decommissioning a node, is it advisable to run cleanup in t= he remaining nodes? The consequences of not to run are the same of not to r= un when adding a node?
=A0 =A0Thank you very much in ad= vance.

Att.

Rodrigo Felix de Almeida
LS= BD - Universidade Federal do Cear=E1
Project Manager
MBA, CSM, CSPO, = SCJP
--047d7bf1641023d6f804e0ddf708--