Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C5781E52E for ; Sat, 24 Nov 2012 15:15:18 +0000 (UTC) Received: (qmail 55679 invoked by uid 500); 24 Nov 2012 15:15:15 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 55565 invoked by uid 500); 24 Nov 2012 15:15:14 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 55535 invoked by uid 99); 24 Nov 2012 15:15:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 24 Nov 2012 15:15:13 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of hsiao.chuanheng@gmail.com designates 209.85.219.44 as permitted sender) Received: from [209.85.219.44] (HELO mail-oa0-f44.google.com) (209.85.219.44) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 24 Nov 2012 15:15:07 +0000 Received: by mail-oa0-f44.google.com with SMTP id n5so10554611oag.31 for ; Sat, 24 Nov 2012 07:14:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=2Sb//8LxNcfE3/ZeOV/t3QoZBAlVflTnl3WUxEnlKpo=; b=A1Q+pmUeynS9gBNMGcGKsouKyW6YWn62CqkYv4kqFCp8/4wtbQmN1Kii49QWg9B6Hl 0d3IAXiTsKkNv5Sx0RFDEvjHaBnTJ5YtESMqo5sm5kot7WplzDTpTwMHO32wPeBDe0HV 1MNETImSaUie5wF9YqFF5m9RO75ArGgxg2sk9Sd8Nm4rcjsG2mR6u1Bc+OuWqReTNlD0 nRezRoCIqR69MF9iim6exPBrUsMgPjljaHjlncHNQXKqeai5QWqo4YmTtB9KWlZ4H2/a uj4aSNTAEeg9mDVGJjfOqici+S1cJ/9oDmTzWk6Kbm4Un901BHeAPUWHalFSSg6lkrlc fUcQ== Received: by 10.60.28.36 with SMTP id y4mr5461523oeg.13.1353770086903; Sat, 24 Nov 2012 07:14:46 -0800 (PST) MIME-Version: 1.0 Received: by 10.76.113.232 with HTTP; Sat, 24 Nov 2012 07:14:26 -0800 (PST) In-Reply-To: References: From: Chuan-Heng Hsiao Date: Sat, 24 Nov 2012 23:14:26 +0800 Message-ID: Subject: Re: huge commitlog To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Hi Cassandra Devs, After trying to setup the same settings (and importing same data) to the 3 VMs on the same machine instead of 3 physical machines, so far I couldn't replicate the exploded-commitlog situation. On my 4-physical-machine setting, everything seems to be back to normal (commitlog size is less than the expected max setting) after restarting the nodes. This time the size of the commitlog of one node is set as 4G, and the others are set as 8G. Few days ago the node with 4G got exploded as 5+G. (the 8G nodes remain at 8G). I checked the log, and found some ERROR about network problems, and some ERROR about "Keys must not be empty". I suspect that besides the network problems, the "Keys must not be empty" ERROR may be the main reason why the commitlog continues growing. (I've already ensured that the Keys must not be empty in my code, so the problem may be raised when syncing internally in cassandra.) I restarted the 4G node as 8G node. Because there is no huge traffic since then, I am not sure whether increasing the commitlog size will solve/reduce this problem or not yet. I'll keep you posted once the commitlog get expldoed again. Sincerely, Hsiao On Mon, Nov 19, 2012 at 11:21 AM, Chuan-Heng Hsiao wrote: > I have RF = 3. Read/Write consistency has already been set as TWO. > > It did seem that the data were not consistent yet. > (There are some CFs that I expected empty after the operations, but still > got some data, and the number of data were decreasing after retrying > to get all data > from that CF) > > Sincerely, > Hsiao > > > On Mon, Nov 19, 2012 at 11:14 AM, Tupshin Harper wrote: >> What consistency level are you writing with? If you were writing with ANY, >> try writing with a higher consistency level. >> >> -Tupshin >> >> On Nov 18, 2012 9:05 PM, "Chuan-Heng Hsiao" >> wrote: >>> >>> Hi Aaron, >>> >>> Thank you very much for the replying. >>> >>> The 700 CFs were created in the beginning (before any insertion.) >>> >>> I did not do anything with commitlog_archiving.properties, so I guess >>> I was not using commit log archiving. >>> >>> What I did was doing a lot of insertions (and some deletions) >>> using another 4 machines with 32 processes in total. >>> (There are 4 nodes in my setting, so there are 8 machines in total) >>> >>> I did see huge logs in /var/log/cassandra after such huge amount of >>> insertions. >>> Right now I can't distinguish whether single insertion also cause huge >>> logs. >>> >>> nodetool flush hanged (maybe because of 200G+ commitlog) >>> >>> Because these machines are not in production (guaranteed no more >>> insertion/deletion) >>> I ended up restarting cassandra one node each time, the commitlog >>> shrinked back to >>> 4G. I am doing repair on each node now. >>> >>> I'll try to re-import and keep logs when the commitlog increases insanely >>> again. >>> >>> Sincerely, >>> Hsiao >>> >>> >>> On Mon, Nov 19, 2012 at 3:19 AM, aaron morton >>> wrote: >>> > I am wondering whether the huge commitlog size is the expected behavior >>> > or >>> > not? >>> > >>> > Nope. >>> > >>> > Did you notice the large log size during or after the inserts ? >>> > If after did the size settle ? >>> > Are you using commit log archiving ? (in commitlog_archiving.properties) >>> > >>> > and around 700 mini column family (around 10M in data_file_directories) >>> > >>> > Can you describe how you created the 700 CF's ? >>> > >>> > and how can we reduce the size of commitlog? >>> > >>> > As a work around nodetool flush should checkpoint the log. >>> > >>> > Cheers >>> > >>> > ----------------- >>> > Aaron Morton >>> > Freelance Cassandra Developer >>> > New Zealand >>> > >>> > @aaronmorton >>> > http://www.thelastpickle.com >>> > >>> > On 17/11/2012, at 2:30 PM, Chuan-Heng Hsiao >>> > wrote: >>> > >>> > hi Cassandra Developers, >>> > >>> > I am experiencing huge commitlog size (200+G) after inserting huge >>> > amount of data. >>> > It is a 4-node cluster with RF= 3, and currently each has 200+G commit >>> > log (so there are around 1T commit log in total) >>> > >>> > The setting of commitlog_total_space_in_mb is default. >>> > >>> > I am using 1.1.6. >>> > >>> > I did not do nodetool cleanup and nodetool flush yet, but >>> > I did nodetool repair -pr for each column family. >>> > >>> > There is 1 huge column family (around 68G in data_file_directories), >>> > and 18 mid-huge column family (around 1G in data_file_directories) >>> > and around 700 mini column family (around 10M in data_file_directories) >>> > >>> > I am wondering whether the huge commitlog size is the expected behavior >>> > or >>> > not? >>> > and how can we reduce the size of commitlog? >>> > >>> > Sincerely, >>> > Hsiao >>> > >>> >