lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doron Cohen <DOR...@il.ibm.com>
Subject Re: Scoring for all the documents in the index relative to a query
Date Tue, 20 Nov 2007 12:23:28 GMT
You can also rely on that by default documents are
collected in-docid-order. You can therefore use your own
hit collector that when collecting doc with id n2,
assuming the previous doc collected had id n1,
would (know to) assign score 0 to all docs
with: n1 < id < n2.

In other words, you can know earlier that a certain
doc is a non-match, not having to wait until all
matching docs were collected. (well, at least I had
the impression that something like this might help you.)

Doron

HAIDUC SONIA wrote:
> I am trying to order all the documents in the index according
> to their similarity to a given query. I am interested in having
> a complete list of *all* the documents in the index with their
> score. From what I understood by reading some documentation,
> Lucene internally assigns scores to all the documents in the
> index according to their similarity to the query, but when
> returning the hits, all the scores that are less than 0 are
> rounded to 0 and only the documents with the score > 0 are
> returned as hits. But what I would like to get is the list
> before this intermediate processing, so the list of all the
> documents with their raw score. I am trying to compare Lucene
> with LSI and for the comparison I want to do, I need the entire
> list of documents. Is there a way that I can get that with Lucene?
> I hope I explained it clearly this time. If you need more
> details let me know.
>
> Thank you,
> Sonia


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message