lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-248) Capitalization Filter Factory
Date Thu, 24 May 2007 17:08:17 GMT

    [ https://issues.apache.org/jira/browse/SOLR-248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498717
] 

Yonik Seeley commented on SOLR-248:
-----------------------------------

> Implemented at the indexing level, I can have different values for the stored value and
indexed terms.
One downside is that it complicates certain things like wildcard or prefix queries (capitalizing
the first letter and lowercasing the second is something that the QueryParser does not support).

You could still store the values verbatim, and index as all lowercase.
Then the application could capitalize the results it gets back as it sees fit.
I do see value pushing this type of logic back to the search engine though.

Of course, I think this might be a more general problem in faceting... what to actually use
as a label for display purposes vs what the terms in the index were (think price formatting,
labels for more complex facet queries, etc).


> Capitalization Filter Factory
> -----------------------------
>
>                 Key: SOLR-248
>                 URL: https://issues.apache.org/jira/browse/SOLR-248
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-248-CapitalizationFilter.patch
>
>
> For tokens that are used in faceting, it is nice to have standard capitalization.  
> I want "Aerial views" and "Aerial Views" to both be: "Aerial Views"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message