lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "AnalyzersTokenizersTokenFilters" by israelekpo
Date Sat, 21 Aug 2010 23:32:19 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "AnalyzersTokenizersTokenFilters" page has been changed by israelekpo.
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters?action=diff&rev1=85&rev2=86

--------------------------------------------------

    * Replaces numeric character entities references like {{{&#65;}}} or {{{&#x7f;}}}
     * The terminating ';' is optional if the entity reference is followed by whitespace.
    * Replaces all [[http://www.w3.org/TR/REC-html40/sgml/entities.html|named character entity
references]].
-    *   is replaced with a space instead of 0xa0
+    * is replaced with a space instead of 0xa0
     * terminating ';' is mandatory to avoid false matches on something like "Alpha&Omega
Corp"
  
  HTML stripping examples:
  ||my <a href="www.foo.bar">link</a> ||my link ||
- ||<?xml?><br>hello<!--comment--> ||hello ||
+ ||<br>hello<!--comment--> ||hello ||
  ||hello<script><-- f('<--internal--></script>'); --></script>
||hello ||
  ||if a<b then print a; ||if a<b then print a; ||
  ||hello <td height=22 nowrap align="left"> ||hello ||
@@ -263, +263 @@

     </analyzer>
  </fieldtype>
  }}}
- <<Anchor('''EdgeNGramFilter''')>>
+ <<Anchor(EdgeNGramFilter)>>
  
  '''solr.EdgeNGramFilterFactory'''
  

Mime
View raw message