lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <>
Subject [jira] [Updated] (LUCENE-4724) TaxonomyReader drops empty string component from CategoryPath
Date Sun, 27 Jan 2013 12:17:17 GMT


Michael McCandless updated LUCENE-4724:

    Attachment: LUCENE-4724.patch

New patch w/ fix.

The problem was String.split: if you end with a delimiter, or have multiple delimiters in
a row, then you lose the empty strings ...
> TaxonomyReader drops empty string component from CategoryPath
> -------------------------------------------------------------
>                 Key: LUCENE-4724
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/facet
>            Reporter: Michael McCandless
>             Fix For: 4.2, 5.0
>         Attachments: LUCENE-4724.patch, LUCENE-4724.patch
> I ran the new PrintTaxonomyStats on a Wikipedia facets index, and it hit an AIOOBE because
there was a child of the /categories path that had only one component ... this was created
because I had added new CategoryPath("categories", "") during indexing.
> I think TaxoReader should preserve and return that empty string from .getPath?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message