lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sreekant Sreedharan <sreeka...@alamy.com>
Subject Understanding the MoreLikeThis Handler
Date Mon, 22 Jun 2015 15:14:03 GMT
 I am trying to implement a search for documents with similar texts like a
sample document. I assumed that the SOLR MoreLikeThis handler would suit me
well. 

I have been able to setup the handler and it seems to work. I have a core
called images2. And I have the following entry in my solrconfig.xml for this
particular core.

  <requestHandler name="/mlt" class="solr.MoreLikeThisHandler">
  </requestHandler>

This setup allows me to make the following query.
/solr/images2/mlt?q=id:86C544948369405D822FA6FBE5EBD49E&mlt.fl=contributor&mlt.match.include=true

I am trying to tell SOLR to seach for all documents that are similar to the
document with id=86C544948369405D822FA6FBE5EBD49E, and I want to match
similars on the field contributor.

This is the result I get.

<response>
<lst name="responseHeader">
   <int name="status">0</int><int name="QTime">0</int>
</lst>
<result name="match" numFound="1" start="0">
<doc>
    <str name="id">86C544948369405D822FA6FBE5EBD49E</str>
    <str name="ar">EEJ6PJ</str><str name="caption">Commuters in Subway
Car</str>
     ....
     <str name="contributorid">3FED3D3D18F84362A53C91700E1807BF</str>
     <str name="contributor">Flirt</str>

</doc>
</result>
<result name="response" numFound="0" start="0"/>
</response>


So here are my questions:
1. Is my query right for what I want to do? Namely: seach for all documents
that are similar to the document with id=86C544948369405D822FA6FBE5EBD49E,
and match similars on the field contributor. If now, how do I encode it?

2. I know that there are several documents with the exact term 'Flirt' in
the contributor field. But I seem to be always getting only one result. I
tested the following SOLR query:

http://localhost:8983/solr/images2/select?q=*:*&wt=xml&fq=contributor:Flirt

And this returns 1138 results. So why is the MLT not returning atleast these
results, which are identical matches?






--
View this message in context: http://lucene.472066.n3.nabble.com/Understanding-the-MoreLikeThis-Handler-tp4213279.html
Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message