lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Rowe (JIRA)" <>
Subject [jira] Updated: (LUCENE-1435) CollationKeyFilter: convert tokens into CollationKeys encoded using IndexableBinaryStringTools
Date Tue, 11 Nov 2008 05:03:44 GMT


Steven Rowe updated LUCENE-1435:

    Attachment: LUCENE-1435.patch

Modifications in this patch:

# Added dependency on ICU4J 4.0
# Introduced ICUCollationKeyFilter, which uses ICU collation to produce the collation keys
# Added Analyzer versions of the Filters, creating IndexableBinaryStringTools-encoded collation
keys from the single token produced by KeywordTokenizer.
# Centralized testing to a base class, which the four test classes extend, to avoid duplication
# Moved from contrib/analyzers/o/a/l/analysis/miscellaneous/ to a new contrib package: contrib/collation,
because it doesn't make sense to add a dependency to the entire contrib/analyzers package
just for ICUCollationKeyFilter/Analyzer

The external ICU4J dependency, which should be checked into contrib/collation/lib/, can be
downloaded here: [].  The license
for this jar is included in the patch at contrib/collation/lib/ICU-LICENSE.txt.

> CollationKeyFilter: convert tokens into CollationKeys encoded using IndexableBinaryStringTools
> ----------------------------------------------------------------------------------------------
>                 Key: LUCENE-1435
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>    Affects Versions: 2.4
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 2.9
>         Attachments: LUCENE-1435.patch, LUCENE-1435.patch
> Converts each token into its CollationKey using the provided collator, and then encodes
the CollationKey with IndexableBinaryStringTools, to allow it to be stored as an index term.
> This will allow for efficient range searches and Sorts over fields that need collation
for proper ordering.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message