lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ian Lea <ian....@gmail.com>
Subject Re: lucene/solr full text search
Date Fri, 30 Jul 2010 20:35:22 GMT
Yes, you can do that.  Make a Query for the 30 papers and use that
with your main query in a BooleanQuery if doing it programatically.
Or with so few documents and papers to match, just in a long string
via QueryParser.  See
http://lucene.apache.org/java/3_0_2/queryparsersyntax.html for details
on query parser syntax - that is also the answer to your other
question.

If performance is a concern and you will be reusing the same set of
pubmed ids you should look at filters, specifically QueryWrapperFilter
and CachingWrapperFilter.


--
Ian.


On Fri, Jul 30, 2010 at 6:19 PM, Shuai Weng <shuai@genome.stanford.edu> wrote:
>
> Sorry for the confusion..
>
> Currently, we have total 7000 fulltext papers (with the pubmed IDs stored as the unique
IDs)
> in the lucene index.  We were wondering if we can search for a given term in a subset
of these papers
> (eg, 30 papers; by providing a list of the pubmed IDs) instead of searching the term
in these 7000
> papers. In another word, we only care about the hits in the 30 given papers. How can
we easily filter out
> the other papers?
>
> Thanks,
> Shuai
>
>
> On Jul 30, 2010, at 9:56 AM, Ian Lea wrote:
>
>> Depending on what exactly you mean by "subset" and "index pool", then yes.
>>
>> If you've got one lucene index containing docs
>>
>> docno: 1
>> category: computers
>> text: some words about computers
>>
>> docno: 2
>> category: computers
>> text: some more words about computers
>>
>> docno: 3
>> category: finance
>> text: some words about finance
>>
>> then a search for "text:words" will match all 3 whereas a search for
>> "category:computers text:words" will only match 2.
>>
>>
>> If this isn't what you are asking about I suggest you provide more detail.
>>
>>
>> --
>> Ian.
>>
>> On Fri, Jul 30, 2010 at 4:15 PM, Shuai Weng <shuai@genome.stanford.edu> wrote:
>>>
>>> Hey,
>>>
>>> I was wondering if we can search info from a subset of papers
>>> instead of from the whole index pool.
>>>
>>> Thanks,
>>> Shuai
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message