lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shuai Weng <sh...@genome.stanford.edu>
Subject Re: lucene/solr full text search
Date Fri, 30 Jul 2010 20:39:45 GMT

I just tried the long query string as you suggested and it works great.

Thanks,
Shuai

On Jul 30, 2010, at 1:35 PM, Ian Lea wrote:

> Yes, you can do that.  Make a Query for the 30 papers and use that
> with your main query in a BooleanQuery if doing it programatically.
> Or with so few documents and papers to match, just in a long string
> via QueryParser.  See
> http://lucene.apache.org/java/3_0_2/queryparsersyntax.html for details
> on query parser syntax - that is also the answer to your other
> question.
> 
> If performance is a concern and you will be reusing the same set of
> pubmed ids you should look at filters, specifically QueryWrapperFilter
> and CachingWrapperFilter.
> 
> 
> --
> Ian.
> 
> 
> On Fri, Jul 30, 2010 at 6:19 PM, Shuai Weng <shuai@genome.stanford.edu> wrote:
>> 
>> Sorry for the confusion..
>> 
>> Currently, we have total 7000 fulltext papers (with the pubmed IDs stored as the
unique IDs)
>> in the lucene index.  We were wondering if we can search for a given term in a subset
of these papers
>> (eg, 30 papers; by providing a list of the pubmed IDs) instead of searching the term
in these 7000
>> papers. In another word, we only care about the hits in the 30 given papers. How
can we easily filter out
>> the other papers?
>> 
>> Thanks,
>> Shuai
>> 
>> 
>> On Jul 30, 2010, at 9:56 AM, Ian Lea wrote:
>> 
>>> Depending on what exactly you mean by "subset" and "index pool", then yes.
>>> 
>>> If you've got one lucene index containing docs
>>> 
>>> docno: 1
>>> category: computers
>>> text: some words about computers
>>> 
>>> docno: 2
>>> category: computers
>>> text: some more words about computers
>>> 
>>> docno: 3
>>> category: finance
>>> text: some words about finance
>>> 
>>> then a search for "text:words" will match all 3 whereas a search for
>>> "category:computers text:words" will only match 2.
>>> 
>>> 
>>> If this isn't what you are asking about I suggest you provide more detail.
>>> 
>>> 
>>> --
>>> Ian.
>>> 
>>> On Fri, Jul 30, 2010 at 4:15 PM, Shuai Weng <shuai@genome.stanford.edu>
wrote:
>>>> 
>>>> Hey,
>>>> 
>>>> I was wondering if we can search info from a subset of papers
>>>> instead of from the whole index pool.
>>>> 
>>>> Thanks,
>>>> Shuai
>>> 
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>> For additional commands, e-mail: java-user-help@lucene.apache.org
>> 
>> 
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>> 
>> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message