Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 68309 invoked from network); 14 Feb 2006 18:36:22 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 14 Feb 2006 18:36:22 -0000 Received: (qmail 30171 invoked by uid 500); 14 Feb 2006 18:36:16 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 30132 invoked by uid 500); 14 Feb 2006 18:36:15 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 30121 invoked by uid 99); 14 Feb 2006 18:36:15 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Feb 2006 10:36:15 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [209.172.74.40] (HELO kazeon.com) (209.172.74.40) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Feb 2006 10:36:14 -0800 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: RE: Size + memory restrictions Date: Tue, 14 Feb 2006 10:38:31 -0800 Message-ID: <4E022DDAB8F45741914ACD6EDFE2309B02012909@BIGFOOT.kazeon.local> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Size + memory restrictions thread-index: AcYxhb9NP07CHTOrRrqbJ0OG4HFbgQAD+D4g From: "Eugene Tuan" To: X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Yes. We have the same problem. It is mainly because TermInforReader.java that takes memory space to keep *.tii. Eugene=20 -----Original Message----- From: Leon Chaddock [mailto:leonchaddock@macranet.co.uk]=20 Sent: Tuesday, February 14, 2006 8:43 AM To: java-user@lucene.apache.org Subject: Size + memory restrictions Hi, we are having tremendous problems building a large lucene index and querying=20 it. The programmers are telling me that when the index file reaches 3.5 gb or 5=20 million docs the index file can no longer grow any larger. To rectify this they have built index files in multiple directories. Now apparently my 4gb memory is not enough to query. Does this seem right to people or does anyone have any experience on largish=20 scale projects. I am completely tearing my hair out here and dont know what to do. Thanks Leon ----- Original Message -----=20 From: "Chun Wei Ho" To: Sent: Monday, February 13, 2006 10:41 AM Subject: Re: Suggesting refine searches with Lucene Thanks. But I am actually looking for approaches/libraries which will help me to come up with the suggested "refine searches". For example I might search for "accident" on the headlines at a news site, which would come back with lots of hits. I am looking for something that would analyze the headlines (or some other specified text field) of all those hits and come up with a list of refined searches that would return a specific/considerable subset of the results, e.g. "Traffic", "plane", "boating", etc, being frequent occurrences of headline text in news that include "accident" in the headlines. I guess its a matter of finding frequently occurring subsequences with some intelligent guessing but I was hoping that someone else better would have already done it in a library that I could adapt. Regards, CW On 2/13/06, Ravi wrote: > Hi , > > > I have implemented by using query "mergeBooleanQueries" method... in this > approach I have created one POJO class of RefineQuery which contains one > variable called Query and I will set whenever I get a search.. > And next time if it is a refined search I will merge current query with=20 > the > refinedquery object and get new query and pass to lucene and set the new > query to refined search object .... this is working fine.. let me know if > have any further ideas or any new technique to implement refined search > > > > if(objSearchParameters.isBSearchInSearch()){ > Query q2 =3D > Query.mergeBooleanQueries(new Query[]{ objRefineQuery.getQuery(), > booleanQuery }); > objRefineQuery.setQuery(q2); > hits =3D searcher.search(q2); > }else{ > objRefineQuery.setQuery(booleanQuery); > hits =3D searcher.search(booleanQuery); > } > > > > > > > > public class RefineQuery { > > private Query query =3D null; > > > public Query getQuery() { > return query; > } > > > public void setQuery(Query query) { > this.query =3D query; > } > > > public String toString(){ > return query.toString(); > } > > } > > > > > Regards, > Ravi Kumar Jaladanki > > -----Original Message----- > From: Chun Wei Ho [mailto:cwho.work@gmail.com] > Sent: Monday, February 13, 2006 3:05 PM > To: java-user@lucene.apache.org > Subject: Suggesting refine searches with Lucene > > Hi, > > I am trying to suggest refine searches for my Lucene search. For > example, if a search turned out too many searches, it would list a > number of document title subsequences that occurred frequently in the > results of the previous search, as possible candidates for refining > the search. > > Does anyone know the right/any approach to implementing this in a > Lucene-based search app? > > Thanks. > > CW > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --=20 Internal Virus Database is out-of-date. Checked by AVG Free Edition. Version: 7.1.375 / Virus Database: 267.15.0/248 - Release Date: 01/02/2006 --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org