lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <>
Subject [jira] Commented: (LUCENE-1813) Add option to ReverseStringFilter to mark reversed tokens
Date Mon, 17 Aug 2009 00:20:14 GMT


Robert Muir commented on LUCENE-1813:

what if we simply make it so there is no boolean option for a marker character, instead it
is ReverseFilter() and ReverseFilter(char marker)
This way, lucene does not define the character used for this operation, and someone can feel
free to select whichever they want (such as U+0001)

When we are on java 5 and can support supp. characters properly (reversing/wildcards,etc),
then we can change this to ReverseFilter(int marker) and someone can use anything they want,
including outside of the BMP?

If this is ok, I will supply a patch.

> Add option to ReverseStringFilter to mark reversed tokens
> ---------------------------------------------------------
>                 Key: LUCENE-1813
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/analyzers
>    Affects Versions: 2.9
>            Reporter: Andrzej Bialecki 
>            Assignee: Robert Muir
>             Fix For: 2.9
>         Attachments: reverseMark-2.patch, reverseMark.patch
> This patch implements additional functionality in the filter to "mark" reversed tokens
with a special marker character (Unicode 0001). This is useful when indexing both straight
and reversed tokens (e.g. to implement efficient leading wildcards search).

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message