Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 63E77200B85 for ; Thu, 15 Sep 2016 12:26:10 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 6286F160AB7; Thu, 15 Sep 2016 10:26:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 7D154160AB5 for ; Thu, 15 Sep 2016 12:26:09 +0200 (CEST) Received: (qmail 18919 invoked by uid 500); 15 Sep 2016 10:26:03 -0000 Mailing-List: contact dev-help@couchdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@couchdb.apache.org Delivered-To: mailing list dev@couchdb.apache.org Received: (qmail 18903 invoked by uid 99); 15 Sep 2016 10:26:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Sep 2016 10:26:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id E9862C292A for ; Thu, 15 Sep 2016 10:26:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.379 X-Spam-Level: ** X-Spam-Status: No, score=2.379 tagged_above=-999 required=6.31 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=2, NO_DNS_FOR_FROM=0.379, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id BykFpoLVj0P7 for ; Thu, 15 Sep 2016 10:25:59 +0000 (UTC) Received: from smtp.justsomehost.net (smtp.justsomehost.net [204.11.51.157]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 4B86D5F246 for ; Thu, 15 Sep 2016 10:25:58 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by smtp.justsomehost.net (Postfix) with ESMTP id B911B562C9E for ; Thu, 15 Sep 2016 06:23:59 -0400 (EDT) Received: from smtp.justsomehost.net ([127.0.0.1]) by localhost (smtp.justsomehost.net [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id zdZU_fHyzmFx for ; Thu, 15 Sep 2016 06:23:58 -0400 (EDT) Received: from localhost (localhost [127.0.0.1]) by smtp.justsomehost.net (Postfix) with ESMTP id 36E2D562D58 for ; Thu, 15 Sep 2016 06:23:58 -0400 (EDT) X-Virus-Scanned: amavisd-new at smtp.justsomehost.net Received: from smtp.justsomehost.net ([127.0.0.1]) by localhost (smtp.justsomehost.net [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id PWP3M0TOw8WA for ; Thu, 15 Sep 2016 06:23:58 -0400 (EDT) Received: from smtp.justsomehost.net (smtp.justsomehost.net [204.11.51.157]) by smtp.justsomehost.net (Postfix) with ESMTP id 07270562C9E for ; Thu, 15 Sep 2016 06:23:58 -0400 (EDT) Date: Thu, 15 Sep 2016 06:23:57 -0400 (EDT) From: Joan Touzet Reply-To: Joan Touzet To: dev@couchdb.apache.org Message-ID: <1387326.1412.1473935144913.JavaMail.Joan@RITA> In-Reply-To: References: <2883838.505.1473451976224.JavaMail.Joan@RITA> <25200628.1118.1473824952732.JavaMail.Joan@RITA> <27966981.1153.1473829264061.JavaMail.Joan@RITA> Subject: Re: High CPU usage caused by clustering and/or replication? MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_1411_5650592.1473935144909" X-Originating-IP: [204.11.51.157] X-Mailer: Zimbra 8.6.0_GA_1194 (Zimbra Desktop/7.2.8_12101_Windows) Thread-Topic: High CPU usage caused by clustering and/or replication? Thread-Index: AQHSDYgcPrswn7modEmrLcCn02LcZKBxkY4AgAa5dG6Tnk4fsOxhwqeLPTxXrBP+Fo+ctY7rlHWU archived-at: Thu, 15 Sep 2016 10:26:10 -0000 ------=_Part_1411_5650592.1473935144909 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Best I can suggest at this point is to send on your logfiles, and if you have any data that helps reproduce the problem you are experiencing that you can share, that'd be great, too. If you can provide this data, please file a new ticket at https://issues.apache.org/jira/secure/CreateIssue!default.jspa selecting CouchDB as the project. -Joan ----- Original Message ----- > From: "Joey Samonte" > To: dev@couchdb.apache.org, "Joan Touzet" > Sent: Wednesday, September 14, 2016 6:59:12 AM > Subject: Re: High CPU usage caused by clustering and/or replication? > Same thing is happening on a Linux cluster. :( It even causes the > beam process to crash. > I am using free tier instances on AWS, with 1 Core and 1GB RAM. Would > that cause this issue? > From: Joan Touzet > Sent: Wednesday, September 14, 2016 4:59 AM > To: dev@couchdb.apache.org > Subject: Re: High CPU usage caused by clustering and/or replication? > I would recommend that you try and reproduce the scenario you're > running into with a Linux-based cluster, and if you find it there > as well, file a bug with us so we can try and work your problem out. > If the problem goes away, then yes, I recommend for now you consider > a > Linux cluster. ;) > I also recommend you use a minimum of 3 nodes for a cluster of q=8 > (the > default). > ----- Original Message ----- > > From: "Joey Samonte" > > To: dev@couchdb.apache.org, "Joan Touzet" > > Sent: Wednesday, September 14, 2016 12:04:04 AM > > Subject: Re: High CPU usage caused by clustering and/or > > replication? > > > > I actually stopped all replications and the high CPU occurs on the > > 2nd node. I have only 2 nodes in the cluster. > > > > We really want to move to CouchDB 2.0, but we can't afford this > > issue > > I am encountering. > > > > Do you suggest we just use Linux-based cluster? :( > > > > > > > > ________________________________ > > From: Joan Touzet > > Sent: Wednesday, September 14, 2016 3:47 AM > > To: dev@couchdb.apache.org > > Subject: Re: High CPU usage caused by clustering and/or > > replication? > > > > Joey, > > > > I'm sorry, I can't reproduce this problem. If you're able to > > provide > > sample data that causes this issue, I'd be pleased to test (on a > > non-virtualized Windows machine) with that. > > > > I will note that I am only testing with a local single CouchDB > > server, > > not in a cluster. What if you try replicating to a non-clustered > > Windows server? We have absolutely zero testing done on CouchDB > > clustered on Windows, and in fact really should declare our support > > for > > that configuration as beta. I will state I have not set up a local > > Windows-based cluster (on separate Windows machines; I have run a 3 > > node-on-1-server development cluster setup) so I can't guarantee > > that > > this isn't the problem. > > > > Please note this Known Issue in CouchDB 2.0: > > https://issues.apache.org/jira/browse/COUCHDB-2980 > > > > To quote our Known Issues document: "for now, we recommend always > > using > > the URL format for both source and target specifications." > > > > Finally, you should know it was my decision to continue Windows > > support > > for CouchDB, as no one else was able to step up. I did so primarily > > because, in my experience, we have developers who develop against a > > local CouchDB instance on their Windows laptops, then move to a > > Linux-hosted cluster for server use. I didn't expect people would > > be > > trying to leverage the Windows port in a real server configuration. > > > > That's not to say we couldn't eventually support Windows in a > > server > > cluster configuration as "out of beta," but it'll take more > > experience > > and more repeatable bugs at the very least - and active CouchDB > > Windows > > developers with performance tuning experience at best. > > > > -Joan > > > > ----- Original Message ----- > > > From: "Joey Samonte" > > > To: dev@couchdb.apache.org, "Joan Touzet" > > > Sent: Tuesday, September 13, 2016 10:53:18 PM > > > Subject: Re: High CPU usage caused by clustering and/or > > > replication? > > > > > > I have tried to stop all replications, then restarted CouchDB, > > > but > > > still having almost 100% CPU with erl.exe > > > > > > > > > ________________________________ > > > From: Joan Touzet > > > Sent: Friday, September 9, 2016 8:11 PM > > > To: dev@couchdb.apache.org > > > Subject: Re: High CPU usage caused by clustering and/or > > > replication? > > > > > > I don't know, a simple view with 100 documents should build > > > quickly > > > unless your view code is unusual. > > > > > > I would recommend replicating the database to a UNIX-based 2.0 > > > cluster > > > to see if the problem is Windows-specific. > > > > > > Sorry that I don't have any other ideas at the moment. > > > > > > -Joan > > > > > > ----- Original Message ----- > > > > From: "Joey Samonte" > > > > To: dev@couchdb.apache.org > > > > Sent: Friday, September 9, 2016 3:08:37 PM > > > > Subject: RE: High CPU usage caused by clustering and/or > > > > replication? > > > > > > > > Hi ma'am Joan, > > > > > > > > But if the cause is a view being rebuilt, why is it so high? I > > > > only > > > > have less than 100 documents on that database which I wanted to > > > > test > > > > initially. > > > > > > > > > Date: Fri, 9 Sep 2016 07:10:35 -0400 > > > > > From: wohali@apache.org > > > > > To: dev@couchdb.apache.org > > > > > Subject: Re: High CPU usage caused by clustering and/or > > > > > replication? > > > > > > > > > > Hi Joey, > > > > > > > > > > I'm going to oversimplify a bit here to make the explanation > > > > > easier. > > > > > > > > > > If you check the process manager/top on each node, do you see > > > > > couchjs > > > > > responsible for that 100% CPU? > > > > > > > > > > Remember that views are not replicated, only documents. So if > > > > > you > > > > > have > > > > > design documents that you replicated, those views may be > > > > > being > > > > > rebuilt. > > > > > > > > > > So-called "internal replication" will be placing segments of > > > > > the > > > > > database > > > > > onto each node, and view building will be happening on each > > > > > node > > > > > as > > > > > well. > > > > > Since the view is also sharded, each shard of the database > > > > > gets > > > > > its > > > > > own > > > > > view shard; that has to be calculated independently on each > > > > > node. > > > > > > > > > > Even if you're not building views, remember that a cluster > > > > > places > > > > > part > > > > > of the database on each node, spreading the load out as > > > > > equally > > > > > as > > > > > possible. In a standard 3-node cluster, each document write > > > > > will > > > > > be > > > > > written redundantly to every node, meaning CPU load will be > > > > > (mostly) > > > > > equal across all 3 nodes as you write that document. > > > > > > > > > > Hope this helps, > > > > > Joan > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > From: "Joey Samonte" > > > > > > To: dev@couchdb.apache.org > > > > > > Sent: Thursday, September 8, 2016 11:30:46 PM > > > > > > Subject: High CPU usage caused by clustering and/or > > > > > > replication? > > > > > > > > > > > > Good day, > > > > > > > > > > > > I have setup a cluster with two nodes, and I am replicating > > > > > > the > > > > > > first > > > > > > node to an existing database through a loadbalancer. But I > > > > > > am > > > > > > getting a 100% CPU usage after a few minutes on both nodes. > > > > > > Is > > > > > > this > > > > > > an issue with RC4? > > > > > > > > > > > > Thanks. > > > > > > > > > > > > > > > ------=_Part_1411_5650592.1473935144909--