lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "TextProfileSignature" by EustacheFelenc
Date Wed, 20 Feb 2013 22:00:43 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "TextProfileSignature" page has been changed by EustacheFelenc:
http://wiki.apache.org/solr/TextProfileSignature?action=diff&rev1=4&rev2=5

  
  TextProfileSignature operates on raw text, without the filtering provided by Analyzers,
and hence will fail to ignore HTML, normalize for diacritics, word stem/semantics, or incorporate
the relative importance of different tokens, etc. It also considers only the bag of words,
ignoring any word order.
  
+ == Configuration ==
+ 
+ === solrconfig.xml ===
+ 
+ Example settings:
+ {{{
+   <!-- An example dedup update processor that creates the "id" field on the fly
+        based on the hash code of some other fields.  This example has overwriteDupes
+        set to false since we are using the id field as the signatureField and Solr
+        will maintain uniqueness based on that anyway. -->
+   <updateRequestProcessorChain name="dedupe">
+     <processor class="org.apache.solr.update.processor.SignatureUpdateProcessorFactory">
+       <bool name="enabled">true</bool>
+       <bool name="overwriteDupes">false</bool>
+       <str name="signatureField">id</str>
+       <str name="fields">name,features,cat</str>
+       <str name="signatureClass">org.apache.solr.update.processor.TextProfileSignature</str>
+       <str name="quantRate">.2</str>
+     </processor>
+     <processor class="solr.LogUpdateProcessorFactory" />
+     <processor class="solr.RunUpdateProcessorFactory" />
+   </updateRequestProcessorChain>
+ }}}
+ 

Mime
View raw message