Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7844511B8D for ; Fri, 23 May 2014 23:52:49 +0000 (UTC) Received: (qmail 95388 invoked by uid 500); 23 May 2014 23:52:48 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 95326 invoked by uid 500); 23 May 2014 23:52:48 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 95318 invoked by uid 99); 23 May 2014 23:52:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 May 2014 23:52:48 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of vfunstein@gmail.com designates 209.85.212.176 as permitted sender) Received: from [209.85.212.176] (HELO mail-wi0-f176.google.com) (209.85.212.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 May 2014 23:52:43 +0000 Received: by mail-wi0-f176.google.com with SMTP id n15so1560458wiw.9 for ; Fri, 23 May 2014 16:52:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=8Jvy/z/qO9bc7c8BDMYkJ98J/LoIRVMGosPl30GNa0o=; b=gM51lllNVX057lIUHCaBIvXyxmdaZP+VZ6mvKWGTwHyryqZH1xOWXvzJ7qTgQScox2 by3ohC5rxj/MRl61IvbA/6nRSXKFB/polRT9qjGo3OKr7vdSg1oEfuaYYNLH2QhT5Zmt Jxo7a4FBiWtveoMQLlRPZPb+9BqTi+SprU0+6zQQMvvbHQtq/CY1VgticGau/mIbX/YR uPC6KxW7CiDMSurSUblsy1GNY83fOcpLpup69sGjw1bX2NsccjOzIVyC6XQSR45V1I4+ uTm2oop1B6SsOcnZaItrU/mLYQy2N7Opl+E21l8XI/M5aZe0FLmKgmn4NQYxyAvIMkl9 5YMg== MIME-Version: 1.0 X-Received: by 10.180.36.138 with SMTP id q10mr6535480wij.4.1400889142562; Fri, 23 May 2014 16:52:22 -0700 (PDT) Received: by 10.194.62.206 with HTTP; Fri, 23 May 2014 16:52:22 -0700 (PDT) In-Reply-To: <01AFE0FB733B9944974A82A09CEB7A0309C835D36D@mail3.imedx.com> References: <01AFE0FB733B9944974A82A09CEB7A0309C81ABB21@mail3.imedx.com> <1400570841.2420.155.camel@te-prime> <01AFE0FB733B9944974A82A09CEB7A0309C835D126@mail3.imedx.com> <1400578244.2420.170.camel@te-prime> <01AFE0FB733B9944974A82A09CEB7A0309C835D16A@mail3.imedx.com> <1400581087.2420.182.camel@te-prime> <01AFE0FB733B9944974A82A09CEB7A0309C835D36D@mail3.imedx.com> Date: Fri, 23 May 2014 16:52:22 -0700 Message-ID: Subject: Re: NewBie To Lucene || Perfect configuration on a 64 bit server From: Vitaly Funstein To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=e89a8f502eeeb657c104fa19eb88 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f502eeeb657c104fa19eb88 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable At the risk of sounding overly critical here, I would say you need to scrap your entire approach of building one small index per request, and just build your entire searchable data store in Lucene/Solr. This is the simplest and probably most maintainable and scalable solution. Even if your index contains 10M+ documents, returning at most 500 search results should be lightning fast compared to the latencies you're seeing right now. To facilitate data export from the DB, take a look at this: http://wiki.apache.org/solr/DataImportHandler On Tue, May 20, 2014 at 7:36 AM, Shruthi wrote: > > > > > -----Original Message----- > From: Toke Eskildsen [mailto:te@statsbiblioteket.dk] > Sent: Tuesday, May 20, 2014 3:48 PM > To: java-user@lucene.apache.org > Subject: Re: NewBie To Lucene || Perfect configuration on a 64 bit server > > On Tue, 2014-05-20 at 11:56 +0200, Shruthi wrote: > > Toke: > > Is 20 second an acceptable response time for your users? > > > > Shruthi: Its definitely not acceptable. PFA the piece of code that we > > are using..Its taking 20seconds. That=E2=80=99s why I drafted this tick= et to > > see where I was going wrong. > > Indexing 1000 documents/sec in Lucene is quite common, so even taking > into account large documents, 20 seconds sounds like quite a bit. > Shruthi: I had attached the code snippet in previous mail. Do you suspect > a foul play there? > > > Shruthi: Well, its two stage process: Client is looking at > > historical data based on a parameters like names, dates,MRN, fields > > etc.. SO the query actually gets the data set fulfilling the > > requirements > > > > If client is interested in doing a text search then he would pass the > > search phrase on the result set. > > So it is not possible for a client to perform a broad phrase search to > start with. And it sounds like your DB-queries are all simple matching? > No complex joins and such? If so, this calls even more for a full > Lucene-index solution, which handles all aspect of the search process. > Shruthi: We call a DB stored procedure to get us the result set for > working with.. > We will be using highlighter API and I don=E2=80=99t think Memory index= can be > used with highlighter. > > > > - Toke Eskildsen, State and University Library, Denmark > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --e89a8f502eeeb657c104fa19eb88--