lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ryan McKinley (JIRA)" <>
Subject [jira] Updated: (SOLR-69) PATCH:MoreLikeThis support
Date Wed, 30 May 2007 23:22:15 GMT


Ryan McKinley updated SOLR-69:

    Attachment: SOLR-69-MoreLikeThisRequestHandler.patch

Updated patch to:
* use searcher.getSchema().getAnalyzer()
* be able to find similar documents from posted text
* be able to return the "interesting terms" used for the MLT query

Andrew: about field boosting...  This handler uses the lucene contrib MoreLikeThis implementation
-- that does not have a way to boost one field above another, If it did, we could easily add
it ;)

> PATCH:MoreLikeThis support
> --------------------------
>                 Key: SOLR-69
>                 URL:
>             Project: Solr
>          Issue Type: Improvement
>          Components: search
>            Reporter: Bertrand Delacretaz
>            Priority: Minor
>         Attachments: lucene-queries-2.0.0.jar, lucene-queries-2.1.1-dev.jar, SOLR-69-MoreLikeThisRequestHandler.patch,
SOLR-69-MoreLikeThisRequestHandler.patch, SOLR-69-MoreLikeThisRequestHandler.patch, SOLR-69-MoreLikeThisRequestHandler.patch,
SOLR-69.patch, SOLR-69.patch, SOLR-69.patch, SOLR-69.patch
> Here's a patch that implements simple support of Lucene's MoreLikeThis class.
> The MoreLikeThisHelper code is heavily based on (hmm..."lifted from" might be more appropriate
;-) Erik Hatcher's example mentioned in
> To use it, add at least the following parameters to a standard or dismax query:
>   mlt=true
>   mlt.fl=list,of,fields,which,define,similarity
> See the MoreLikeThisHelper source code for more parameters.
> Here are two URLs that work with the example config, after loading all documents found
in exampledocs in the index (just to show that it seems to work - of course you need a larger
corpus to make it interesting):
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=standard&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> http://localhost:8983/solr/select/?stylesheet=&q=apache&qt=dismax&mlt=true&mlt.fl=manu,cat&mlt.mindf=1&mlt.mindf=1&fl=id,score
> Results are added to the output like this:
> <response>
>   ...
>   <lst name="moreLikeThis">
>     <result name="UTF8TEST" numFound="1" start="0" maxScore="1.5293242">
>       <doc>
>         <float name="score">1.5293242</float>
>         <str name="id">SOLR1000</str>
>       </doc>
>     </result>
>     <result name="SOLR1000" numFound="1" start="0" maxScore="1.5293242">
>       <doc>
>         <float name="score">1.5293242</float>
>         <str name="id">UTF8TEST</str>
>       </doc>
>     </result>
>   </lst>
> I haven't tested this extensively yet, will do in the next few days. But comments are
welcome of course.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message