lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jim Murphy <>
Subject Re: Using Solr for Info Retreval not so much Search...
Date Wed, 30 Jul 2008 23:38:58 GMT

*Excellent* so a custom QueryComponent it is.

The Solr score doesn't factor in too much - our search needs are modest -
just does it contain the keyword (or variants, stems etc) or not.  So the
query trims down from ~100M to 10-10000.  That way the more expensive
filtering operates at the smaller set as you suggest.

I need to sort by one of my date fields or the external rank.  The first is
easy.  The second is difficult so I will have to query the external system
for all matching docs - but if its on the reduced set its manageable.

One Remaining Question:  I'd like to include my external threshold value int
he document.  Any ideas?  Can I stuff a float field somewhere on the docs? 



hossman wrote:
> : 1. Query the index for entries matching keyword.
> : 2. remove any entries that are below a threshold score from the external
> : system
> what do you need to sort by? .. if it's the threshold score from your 
> external system, you have no way of avoiding a call out to your external 
> system for every matching doc ... if you want to sort by the "Solr Score" 
> then it should be fairly easy to write a SearchComponent that gets a 
> DocList and walks them in order removing anything that doesn't meet the 
> threshold (re-executing the query with a higher number of rows if it 
> exhausts the current DocList) untill you've got enough to return to your 
> client.
> -Hoss

View this message in context:
Sent from the Solr - User mailing list archive at

View raw message