Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3D42B115A0 for ; Tue, 5 Aug 2014 13:58:02 +0000 (UTC) Received: (qmail 34436 invoked by uid 500); 5 Aug 2014 13:57:58 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 34397 invoked by uid 500); 5 Aug 2014 13:57:57 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 34386 invoked by uid 99); 5 Aug 2014 13:57:57 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Aug 2014 13:57:57 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ruchir.jha@gmail.com designates 209.85.217.178 as permitted sender) Received: from [209.85.217.178] (HELO mail-lb0-f178.google.com) (209.85.217.178) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Aug 2014 13:57:53 +0000 Received: by mail-lb0-f178.google.com with SMTP id c11so745422lbj.23 for ; Tue, 05 Aug 2014 06:57:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=8UVlj8JX1u4EJesKvAL5KX/xjuo9ZUj2GuiqZyDwI9I=; b=VXDShPSwkjjn9nrW7SdElVop6v+7KLSeRnxG76JhjdKcabDCRVbtlSQUhx8H3J2AU1 q8maPm0dp5uFX+aQ0zpq8Xwjzd6gh8YHWr5tQinOhhwtnjv+2BXcE0Mnowkn6STX44RC pQHwVY8VKRu0fiX3ubb08gvFUlxAxZitukg38DGFtDjNLtsF0jgWWWpaIb7KT3EHnbSW V9aePF2uteGt90Zx5Cyx8Hby75BhlELQOkCuTy1mrY9kaqsjaoH1wflJ0vDq0mQxzPah JM9Yqy0qKXP6RzZTtl8DnP63kRNashZhHbRsIA1Ou6xHe8XB8I2D3yTGsvsu9MuOWlz8 6B3g== MIME-Version: 1.0 X-Received: by 10.112.204.164 with SMTP id kz4mr4109274lbc.15.1407247052018; Tue, 05 Aug 2014 06:57:32 -0700 (PDT) Received: by 10.114.79.233 with HTTP; Tue, 5 Aug 2014 06:57:31 -0700 (PDT) In-Reply-To: References: Date: Tue, 5 Aug 2014 09:57:31 -0400 Message-ID: Subject: Re: Node bootstrap From: Ruchir Jha To: "user@cassandra.apache.org" Content-Type: multipart/alternative; boundary=001a11c3bfaaa59ccc04ffe23c0c X-Virus-Checked: Checked by ClamAV on apache.org --001a11c3bfaaa59ccc04ffe23c0c Content-Type: text/plain; charset=UTF-8 Thanks Patricia for your response! On the new node, I just see a lot of the following: INFO [FlushWriter:75] 2014-08-05 09:53:04,394 Memtable.java (line 400) Writing Memtable INFO [CompactionExecutor:3] 2014-08-05 09:53:11,132 CompactionTask.java (line 262) Compacted 12 sstables to so basically it is just busy flushing, and compacting. Would you have any ideas on why the 2x disk space blow up. My understanding was that if initial_token is left empty on the new node, it just contacts the heaviest node and bisects its token range. And the heaviest node is around 2.1 TB, and the new node is already at 4 TB. Could this be because compaction is falling behind? Ruchir On Mon, Aug 4, 2014 at 7:23 PM, Patricia Gorla wrote: > Ruchir, > > What exactly are you seeing in the logs? Are you running major compactions > on the new bootstrapping node? > > With respect to the seed list, it is generally advisable to use 3 seed > nodes per AZ / DC. > > Cheers, > > > On Mon, Aug 4, 2014 at 11:41 AM, Ruchir Jha wrote: > >> I am trying to bootstrap the thirteenth node in a 12 node cluster where >> the average data size per node is about 2.1 TB. The bootstrap streaming has >> been going on for 2 days now, and the disk size on the new node is already >> above 4 TB and still going. Is this because the new node is running major >> compactions while the streaming is going on? >> >> One thing that I noticed that seemed off was the seeds property in the >> yaml of the 13th node comprises of 1..12. Where as the seeds property on >> the existing 12 nodes consists of all the other nodes except the thirteenth >> node. Is this an issue? >> >> Any other insight is appreciated? >> >> Ruchir. >> >> >> > > > -- > Patricia Gorla > @patriciagorla > > Consultant > Apache Cassandra Consulting > http://www.thelastpickle.com > --001a11c3bfaaa59ccc04ffe23c0c Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Thanks Patricia for your response!

On the new node, I just see a lot of the following:

INFO [FlushWriter:75] 2014-08-05 09:53:04,394 Memtable.java (line = 400) Writing Memtable
INFO [CompactionExecutor:3] 2014-08-05 09:53:11,132 CompactionTask.ja= va (line 262) Compacted 12 sstables to

so basically = it is just busy flushing, and compacting. Would you have any ideas on why t= he 2x disk space blow up. My understanding was that if initial_token is lef= t empty on the new node, it just contacts the heaviest node and bisects its= token range. And the heaviest node is around 2.1 TB, and the new node is a= lready at 4 TB. Could this be because compaction is falling behind?

Ruchir


On Mon, Aug 4, 2014 at 7:23 PM, Patricia Gorla <patricia@thelastpickle.com> wrote:
Ruchir,

= What exactly are you seeing in the logs? Are you running major compactions = on the new bootstrapping node?

With respect to the seed list, it is generally advisabl= e to use 3 seed nodes per AZ / DC.

Cheers,


On Mon, Aug 4, 2014 at 11:= 41 AM, Ruchir Jha <ruchir.jha@gmail.com> wrote:
I am trying to bootstrap th= e thirteenth node in a 12 node cluster where the average data size per node= is about 2.1 TB. The bootstrap streaming has been going on for 2 days now,= and the disk size on the new node is already above 4 TB and still going. I= s this because the new node is running major compactions while the streamin= g is going on? =C2=A0

One thing that I noticed that seemed off was the seeds prope= rty in the yaml of the 13th node comprises of 1..12. Where as the seeds pro= perty on the existing 12 nodes consists of all the other nodes except the t= hirteenth node. Is this an issue?

Any other insight is appreciated?

Ruchir.




--
Patricia Go= rla
@patriciagorla

Consultant
Apache= Cassandra Consulting

--001a11c3bfaaa59ccc04ffe23c0c--