lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Max Lynch <ihas...@gmail.com>
Subject Search a URL
Date Thu, 23 Sep 2010 20:59:23 GMT
Is there a tokenizer that will allow me to search for parts of a URL?  For
example, the search "google" would match on the data "
http://mail.google.com/dlkjadf"

This tokenizer factory doesn't seem to be sufficient:

        <fieldType name="text_standard" class="solr.TextField"
positionIncrementGap="100">
            <analyzer type="index">
                <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="0" generateNumberParts="1" catenateWords="1"
catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
                <filter class="solr.LowerCaseFilterFactory"/>
                <filter class="solr.SnowballPorterFilterFactory"
language="English" protected="protwords.txt"/>
            </analyzer>
            <analyzer type="query">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>

                 <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="0" generateNumberParts="1" catenateWords="1"
catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
                 <filter class="solr.LowerCaseFilterFactory"/>
                 <filter class="solr.SnowballPorterFilterFactory"
language="English" protected="protwords.txt"/>
             </analyzer>
    </fieldType>

Thanks.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message