lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Rowe (JIRA)" <j...@apache.org>
Subject [jira] Updated: (LUCENE-1435) CollationKeyFilter: convert tokens into CollationKeys encoded using IndexableBinaryStringTools
Date Tue, 11 Nov 2008 05:03:44 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Steven Rowe updated LUCENE-1435:
--------------------------------

    Attachment: LUCENE-1435.patch

Modifications in this patch:

# Added dependency on ICU4J 4.0
# Introduced ICUCollationKeyFilter, which uses ICU collation to produce the collation keys
# Added Analyzer versions of the Filters, creating IndexableBinaryStringTools-encoded collation
keys from the single token produced by KeywordTokenizer.
# Centralized testing to a base class, which the four test classes extend, to avoid duplication
# Moved from contrib/analyzers/o/a/l/analysis/miscellaneous/ to a new contrib package: contrib/collation,
because it doesn't make sense to add a dependency to the entire contrib/analyzers package
just for ICUCollationKeyFilter/Analyzer

The external ICU4J dependency, which should be checked into contrib/collation/lib/, can be
downloaded here: [http://download.icu-project.org/files/icu4j/4.0/icu4j-4_0.jar].  The license
for this jar is included in the patch at contrib/collation/lib/ICU-LICENSE.txt.


> CollationKeyFilter: convert tokens into CollationKeys encoded using IndexableBinaryStringTools
> ----------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-1435
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1435
>             Project: Lucene - Java
>          Issue Type: New Feature
>    Affects Versions: 2.4
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 2.9
>
>         Attachments: LUCENE-1435.patch, LUCENE-1435.patch
>
>
> Converts each token into its CollationKey using the provided collator, and then encodes
the CollationKey with IndexableBinaryStringTools, to allow it to be stored as an index term.
> This will allow for efficient range searches and Sorts over fields that need collation
for proper ordering.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message