lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <j...@apache.org>
Subject [jira] Updated: (LUCENE-1441) KeywordTokenizer does not set start/end offset of the Token it produces
Date Fri, 07 Nov 2008 19:15:44 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-1441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Michael McCandless updated LUCENE-1441:
---------------------------------------

    Attachment: LUCENE-1441.patch

Attached patch.  I plan to commit shortly.

> KeywordTokenizer does not set start/end offset of the Token it produces
> -----------------------------------------------------------------------
>
>                 Key: LUCENE-1441
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1441
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Analysis
>    Affects Versions: 2.3, 2.3.1, 2.3.2, 2.4
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>            Priority: Minor
>         Attachments: LUCENE-1441.patch
>
>
> I think just adding these two lines in the next(Token) method is the right fix:
>            reusableToken.setStartOffset(0);
>            reusableToken.setEndOffset(upto);
> I don't think this is a back compat issue because the start/end offset are now meaningless
since they will inherit whatever the reusable token had previously been used for.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message