Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4FD7310504 for ; Mon, 3 Nov 2014 17:27:47 +0000 (UTC) Received: (qmail 38804 invoked by uid 500); 3 Nov 2014 17:27:43 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 38736 invoked by uid 500); 3 Nov 2014 17:27:43 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 38716 invoked by uid 99); 3 Nov 2014 17:27:43 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Nov 2014 17:27:43 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of solrexpert@gmail.com designates 209.85.217.176 as permitted sender) Received: from [209.85.217.176] (HELO mail-lb0-f176.google.com) (209.85.217.176) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Nov 2014 17:27:39 +0000 Received: by mail-lb0-f176.google.com with SMTP id z11so7623159lbi.21 for ; Mon, 03 Nov 2014 09:27:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=wgNBOUW+irFMroWLeWKmwrPtsrTJJppHNfP9+0+FcqQ=; b=YohkkDlrDme+3409aVE59O2SJjJ4XhcrBalCEQQK8vTQ+qMj3M1CEEl2ECW83Ovokc FAYMBokmx+uoyV8Loj9iVJ1Tis/jF5DA4sQ6/Fx00kZ2KvBF8Nd2LdWQemW3tcBSpMsf dFTn3dfBCquTPJaASUL2fNcAY4SqOWkvdupKa/GHkCLXS6wDK8JIet3oAdo1EKPAhPkE 5c1f4qmTzbHJojCtsveTyJQEWzD9p24wZiJwCK55vhWmIMoBHg/AMh9P7l3jRhPvJeFj htTjDFaxLCjzcGZOGPGwWCSd8zlxSbUHPX7B2F2FQllgBM1oKmQk2xuQKddAjadSjLQY KOyQ== MIME-Version: 1.0 X-Received: by 10.112.85.138 with SMTP id h10mr52455431lbz.33.1415035637839; Mon, 03 Nov 2014 09:27:17 -0800 (PST) Received: by 10.112.95.17 with HTTP; Mon, 3 Nov 2014 09:27:17 -0800 (PST) In-Reply-To: References: Date: Mon, 3 Nov 2014 19:27:17 +0200 Message-ID: Subject: Re: Solr slow start up (tlog is small) From: Dmitry Kan To: "solr-user@lucene.apache.org" Content-Type: multipart/alternative; boundary=001a1134716889bafb0506f7a829 X-Virus-Checked: Checked by ClamAV on apache.org --001a1134716889bafb0506f7a829 Content-Type: text/plain; charset=UTF-8 One other reason for a slow start-up can be large number of segments in the index. Which I'm guessing is not the case since you optimized? But anyway, what's the number of segments in both 280G and 185G indices? Dmitry On Mon, Nov 3, 2014 at 6:17 PM, Yonik Seeley wrote: > Can you tell from the logs what Solr is doing during that time? > Do you have any warming queries configured? > Also see this: https://issues.apache.org/jira/browse/SOLR-6679 > (comment out suggester related stuff if you aren't using it) > > -Yonik > http://heliosearch.org - native code faceting, facet functions, > sub-facets, off-heap data > > > On Mon, Nov 3, 2014 at 11:03 AM, Po-Yu Chuang > wrote: > > Hi, > > > > I am using Solr 4.9 with Tomcat and it works fine except that the > > deployment of solr.war is too long. While deploying Solr, all webapps on > > Tomcat stop responding which is unacceptable. Most articles I found say > > that it might result from big transaction log because of uncommitted > > documents, but this is not my case. > > > > At first, the Solr data is 280G and the start up time is 30 minutes. > Then I > > set a field to stored="false" and re-index whole data. The data size > became > > 185G and the start up time reduced to 17 minutes, but it is still too > long. > > > > Here are some numbers I measured: > > > > 1) > > Solr home: 280G > > tlog: 500K > > 30 min to start up > > While starting up, disk read is constantly about 50MB/s (according to > > dstat). So it seems that Solr reads 30m * 60s * 50MB/s = 90GB of data > while > > starting up, which is 30% of index data size. > > > > 2) > > Solr home: 185G > > tlog: 5M > > 17 minutes to start up > > While starting up, disk read is constantly about 5MB/s (according to > > dstat). So it seems that Solr reads 17m * 60s *5MB/s = 5GB of data while > > starting up, which is about 3% of index data size. > > > > p.s. I did commit each time 1000 documents being added and did > optimization > > after all documents are added. > > > > Any ideas or suggestions would be appreciated. > > > > Thanks, > > Po-Yu > -- Dmitry Kan Luke Toolbox: http://github.com/DmitryKey/luke Blog: http://dmitrykan.blogspot.com Twitter: http://twitter.com/dmitrykan SemanticAnalyzer: www.semanticanalyzer.info --001a1134716889bafb0506f7a829--