lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Boosting for most recent documents
Date Thu, 09 Jul 2009 21:21:13 GMT

Ah, with multiple indices you can't rely on the max Lucene doc Id.  I think you have to do
with the timestamp approach.

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: vivek sar <vivextra@gmail.com>
> To: solr-user@lucene.apache.org
> Sent: Thursday, July 9, 2009 1:13:54 PM
> Subject: Re: Boosting for most recent documents
> 
> Thanks Otis. I got a distributed index - using Solr multi-core.
> Basically, I got 6 indexer instances running on 3 different boxes.
> Couple of questions,
> 
> 1)  Is it possible to sort on document id for multiple-shards? How is that done?
> 2) How would boost by most recent doc at index time?
> 
> Thanks,
> -vivek
> 
> 
> 
> On Wed, Jul 8, 2009 at 7:47 PM, Otis
> Gospodneticwrote:
> >
> > Sort by the internal Lucene document ID and pick the highest one.  That might 
> do the job for you.
> >
> > Otis
> > --
> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> >
> >
> >
> > ----- Original Message ----
> >> From: vivek sar 
> >> To: solr-user 
> >> Sent: Wednesday, July 8, 2009 8:34:16 PM
> >> Subject: Boosting for most recent documents
> >>
> >> Hi,
> >>
> >>   I'm trying to find a way to get the most recent entry for the
> >> searched word. For ex., if I have a document with field name "user".
> >> If I search for user:vivek, I want to get the document that was
> >> indexed most recently. Two ways I could think of,
> >>
> >> 1) Sort by some time stamp field - but with millions of documents this
> >> becomes a huge memory problem as we have seen OOM with sorting before
> >> 2) Boost the most recent document - I'm not sure how to do this.
> >> Basically, we want to have the most recent document score higher than
> >> any other and then we can retrieve just 10 records and sort in the
> >> application by time stamp field to get the most recent document
> >> matching the keyword.
> >>
> >> Any suggestion on how can this be done?
> >>
> >> Thanks,
> >> -vivek
> >
> >


Mime
View raw message