lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dave Kor <dave...@gmail.com>
Subject Re: Related searches
Date Wed, 01 Feb 2006 09:38:52 GMT
On 1/30/06, Leon Chaddock <leonchaddock@macranet.co.uk> wrote:
> Hi,
> Does anyone know if it is possible to show related searches with lucene, for example
if
> someone searched for "car insurance" you could bring back the results and related
> searches like these

One possible way is to use the vector space model on the set of
relevant documents returned by each query.

For example,
Relevant documents for the query "car insurance" are docids 1, 2, 4, 9, 10.
Relevant documents for the query "automobile insurance" are docids 2,
4, 8, 9, 10.
Relevant documents for the query "life insurance" are docids 3, 5, 7, 9.

Here, "automobile insurance" will be scored as more similar to "car
insurance" than "life insurance" because there is a larger set of
overlapping docids.

Lucene can be adapted for this purpose by creating a second index that
stores all unique queries and their set of relevant docids as Lucene
Documents. Instead of indexing text terms, we index docids. Finding
queries similiar to the original query, Q, is a simple matter of
querying this second index with the set of docids relevent to query Q.

Hope this helps.


--
Dave Kor, Research Assistant
Center for Information Mining and Extraction
School of Computing
National University of Singapore.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message