Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 99C6F11AB0 for ; Mon, 2 Jun 2014 11:35:18 +0000 (UTC) Received: (qmail 17432 invoked by uid 500); 2 Jun 2014 11:35:17 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 17364 invoked by uid 500); 2 Jun 2014 11:35:17 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 17353 invoked by uid 99); 2 Jun 2014 11:35:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Jun 2014 11:35:17 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of SRS0=RV_V5g=27=basetechnology.com=jack@yourhostingaccount.com designates 65.254.254.77 as permitted sender) Received: from [65.254.254.77] (HELO walmailout07.yourhostingaccount.com) (65.254.254.77) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Jun 2014 11:35:12 +0000 Received: from mailscan06.yourhostingaccount.com ([10.1.15.6] helo=walmailscan06.yourhostingaccount.com) by walmailout07.yourhostingaccount.com with esmtp (Exim) id 1WrQVs-0008Ni-CC for java-user@lucene.apache.org; Mon, 02 Jun 2014 07:34:48 -0400 Received: from impout02.yourhostingaccount.com ([10.1.55.2] helo=impout02.yourhostingaccount.com) by walmailscan06.yourhostingaccount.com with esmtp (Exim) id 1WrQVs-0002nQ-Ar for java-user@lucene.apache.org; Mon, 02 Jun 2014 07:34:48 -0400 Received: from walauthsmtp02.yourhostingaccount.com ([10.1.18.2]) by impout02.yourhostingaccount.com with NO UCE id 9Pao1o00102goRm01PaoWf; Mon, 02 Jun 2014 07:34:48 -0400 X-Authority-Analysis: v=2.0 cv=aPZyWMBm c=1 sm=1 a=UkMH5KcvGpXfM81wB0t8ug==:17 a=aQzbgH187woA:10 a=3jZET7lWBKwA:10 a=IkcTkHD0fZMA:10 a=jvYhGVW7AAAA:8 a=mV9VRH-2AAAA:8 a=kv-0wrpBTYDGdXi6EpsA:9 a=QEXdDO2ut3YA:10 a=9uB69uSRq6O8BO2R:21 a=jQA_c1hS6v6_5PnZ:21 a=x/Hyyyp0PjmVEO0KwK8auA==:117 X-EN-OrigOutIP: 10.1.18.2 X-EN-IMPSID: 9Pao1o00102goRm01PaoWf Received: from 207-237-113-28.c3-0.nyr-ubr1.nyr.ny.cable.rcn.com ([207.237.113.28]:61486 helo=JackKrupansky14) by walauthsmtp02.yourhostingaccount.com with esmtpa (Exim) id 1WrQVs-0000Wq-2j for java-user@lucene.apache.org; Mon, 02 Jun 2014 07:34:48 -0400 Message-ID: <14714CF93FA04417B7B936078D373869@JackKrupansky14> From: "Jack Krupansky" To: References: <01AFE0FB733B9944974A82A09CEB7A0309C81ABB21@mail3.imedx.com> <1400570841.2420.155.camel@te-prime> <01AFE0FB733B9944974A82A09CEB7A0309C835D126@mail3.imedx.com> <1400578244.2420.170.camel@te-prime> <01AFE0FB733B9944974A82A09CEB7A0309C835D16A@mail3.imedx.com> <1400581087.2420.182.camel@te-prime> <01AFE0FB733B9944974A82A09CEB7A0309C835D36D@mail3.imedx.com> <1401153243448-4138215.post@n3.nabble.com> <003301cf7974$7a200600$6e601200$@gmx.de> <01AFE0FB733B9944974A82A09CEB7A0309C881E6CF@mail3.imedx.com> <538C1F0F.3010300@mailarchiva.com> In-Reply-To: <538C1F0F.3010300@mailarchiva.com> Subject: Re: search performance Date: Mon, 2 Jun 2014 07:34:49 -0400 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="UTF-8"; reply-type=response Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal Importance: Normal X-Mailer: Microsoft Windows Live Mail 16.4.3528.331 X-MimeOLE: Produced By Microsoft MimeOLE V16.4.3528.331 X-EN-UserInfo: e0a4b55451ed9f27313ebf02e3d4348d:931c98230c6409dcc37fa7e93b490c27 X-EN-AuthUser: jack@basetechnology.com Sender: "Jack Krupansky" X-EN-OrigIP: 207.237.113.28 X-EN-OrigHost: 207-237-113-28.c3-0.nyr-ubr1.nyr.ny.cable.rcn.com X-Virus-Checked: Checked by ClamAV on apache.org Do you have enough system memory to fit the entire index in OS system memory so that the OS can fully cache it instead of thrashing with I/O? Do you see a lot of I/O or are the queries compute-bound? You said you have a 128GB machine, so that sounds small for your index. Have you tried a 256GB machine? How frequent are your commits for updates while doing queries? -- Jack Krupansky -----Original Message----- From: Jamie Sent: Monday, June 2, 2014 2:51 AM To: java-user@lucene.apache.org Subject: search performance Greetings Despite following all the recommended optimizations (as described at http://wiki.apache.org/lucene-java/ImproveSearchingSpeed) , in some of our installations, search performance has reached the point where is it unacceptably slow. For instance, in one environment, the total index size is 200GB, with 150 million documents indexed. With NRT enabled, search speed is roughly 5 minutes on average. The server resources are: 2x6 Core Intel CPU, 128GB, 2 SSD for index and RAID 0, with Linux. The only thing we haven't yet done, is to upgrade Lucene from 4.7.x to 4.8.x. Is this likely to make any noticeable difference in performance? Clearly, longer term, we need to move to a distributed search model. We thought to take advantage of the distributed search features offered in Solr, however, our solution is very tightly integrated into Lucene directly (since Solr didn't exist when we started out). Moving to Solr now seems like a daunting prospect. We've also following the Katta project with interest, but it doesn't appear support distributed indexing, and development on it seems to have stalled. It would be nice if there were a distributed search project on the Lucene level that we could use. I realize this is a rather vague question, but are there any further suggestions on ways to improve search performance? We need cheap and dirty ideas, as well as longer term advice on a possible path forward. Much appreciate Jamie --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org