Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AFE9A101F7 for ; Tue, 8 Apr 2014 23:49:30 +0000 (UTC) Received: (qmail 37012 invoked by uid 500); 8 Apr 2014 23:49:25 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 36891 invoked by uid 500); 8 Apr 2014 23:49:25 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 36883 invoked by uid 99); 8 Apr 2014 23:49:25 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Apr 2014 23:49:25 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of solr@elyograg.org designates 166.70.79.219 as permitted sender) Received: from [166.70.79.219] (HELO frodo.elyograg.org) (166.70.79.219) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Apr 2014 23:49:21 +0000 Received: from localhost (localhost [127.0.0.1]) by frodo.elyograg.org (Postfix) with ESMTP id 4B21091F2 for ; Tue, 8 Apr 2014 17:49:00 -0600 (MDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=elyograg.org; h= content-transfer-encoding:content-type:content-type:in-reply-to :references:subject:subject:mime-version:user-agent:from:from :date:date:message-id:received:received; s=mail; t=1397000939; bh=2CP6q5H94nEomfLaUrrnT6f50sOGlPk+oPAoumSFE/I=; b=rwLzhcvSHfgO a19N0JPU8ZRhFrc+YBTRIauHyJqrBWvR8aOMwftsB5WOTQwOZMX70aeHRYdRAAis Q0dw8a/XUaK3yWq/WTZualQ5k8bktMG9O14RaKo5ipRLJYzxba+bUNCnyHUxRMKb tlzK0FFCVJdaCKk7KN4Jy7x+86/R+Do= X-Virus-Scanned: Debian amavisd-new at frodo.elyograg.org Received: from frodo.elyograg.org ([127.0.0.1]) by localhost (frodo.elyograg.org [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id r8kfgN7wFr7Z for ; Tue, 8 Apr 2014 17:48:59 -0600 (MDT) Received: from [192.168.1.103] (103.int.elyograg.org [192.168.1.103]) (using TLSv1 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: elyograg@elyograg.org) by frodo.elyograg.org (Postfix) with ESMTPSA id B483E8692 for ; Tue, 8 Apr 2014 17:48:59 -0600 (MDT) Message-ID: <53448AEB.8090204@elyograg.org> Date: Tue, 08 Apr 2014 17:48:59 -0600 From: Shawn Heisey User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.4.0 MIME-Version: 1.0 To: solr-user@lucene.apache.org Subject: Re: Investigating performance issues in solr cloud References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 4/8/2014 5:30 PM, Utkarsh Sengar wrote: > I see sudden drop in throughput once every 3-4 days. The "downtime" is for > about 2-6minutes and things stabilize after that. > > But I am not sure what is causing it the problem. > > I have 3 shards with 20GB of data on each shard. > Solr dashboard: http://i.imgur.com/6RWT2Dj.png > Newrelic graphs when during the downtime of about 4hours: > http://i.imgur.com/9vhKiB2.png > JVM memory graph says its normal: http://i.imgur.com/pAycgdC.png > > I thought it was GC pauses but it should be in the newrelic logs. > > How can I go about investigating this problem? I am running solr 4.4.0, I > don't see a strong reason to upgrade yet. Lots of questions: How many total machines? What is your replicationFactor? Does each machine have one shard replica and therefore 20GB of total index data, or if you add up all the index directories for the cores on each machine, is there more than 20GB of data? What options are you passing to your JVM when you start the servlet container that runs Solr? The dashboard says that this machine has 24GB of RAM and a 9GB heap. Is this the case for all machines? Is there any software other than Solr on the machine? If it's a linux/unix machine, can you run top, press shift-M to sort by memory, and grab a screenshot? If it's a Windows machine, a similar list should be available in the task manager, but it must include all processes for all users on the whole machine, and it would be best if it showed virtual memory as well as private. Thanks, Shawn