lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Klaas (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter
Date Mon, 16 Jun 2008 20:31:45 GMT

    [ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12605403#action_12605403
] 

Mike Klaas commented on SOLR-14:
--------------------------------

Note that it is very easy to use an external TokenFilter, so you could just c&p WDF into
your own class and make the changes.

(Though I'm not saying that this _shouldn't_ make it in for 1.3)

> Add the ability to preserve the original term when using WordDelimiterFilter
> ----------------------------------------------------------------------------
>
>                 Key: SOLR-14
>                 URL: https://issues.apache.org/jira/browse/SOLR-14
>             Project: Solr
>          Issue Type: Improvement
>          Components: search
>            Reporter: Richard "Trey" Hyde
>         Attachments: TokenizerFactory.java, WordDelimiterFilter.patch, WordDelimiterFilter.patch
>
>
> When doing prefix searching, you need to hang on to the original term othewise you'll
miss many matches you should be making.
> Data: ABC-12345
> WordDelimiterFitler may change this into
> ABC 12345 ABC12345
> A user may enter a search such as 
>  ABC\-123*
> Which will fail to find a match given the above scenario.
> The attached patch will allow the use of the "preserveOriginal" option to WordDelimiterFilter
and will analyse as
> ABC 12345 ABC12345  ABC-12345 
> in which case we will get a postive match.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message