lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael McCandless (JIRA)" <>
Subject [jira] [Commented] (LUCENE-4658) Per-segment tracking of external/side-car data
Date Sun, 17 Mar 2013 15:23:16 GMT


Michael McCandless commented on LUCENE-4658:

bq. As I understand it, Lucene's faceting module uses a side-car index. If so, then if the
feature proposed here is a good API then the faceting module will use it. No?

It does use a side-car (taxonomy) index, so that facet labels use global ords, which makes
counting/NRT reopen fast.

But, that index is global, vs this patch which adds a per-segment side-car, so it wouldn't
quite fit, until/unless we change taxonomy writer/reader to work per-segment.
> Per-segment tracking of external/side-car data
> ----------------------------------------------
>                 Key: LUCENE-4658
>                 URL:
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>         Attachments: LUCENE-4658.patch, LUCENE-4658.patch
> Spinoff from David's idea on LUCENE-4258
> (
> I made a prototype patch that allows custom per-segment "side-car
> data".  It adds an abstract ExternalSegmentData class.  The idea is
> the app implements this, and IndexWriter will pass each Document
> through to it, and call on it to do flushing/merging.  I added a
> setter to IndexWriterConfig to enable it, but I think this would
> really belong in Codec ...
> I haven't tackled the read-side yet, though this is already usable
> without that (ie, the app can just open its own files, read them,
> etc.).
> The random test case passes.
> I think for example this might make it easier for Solr/ElasticSearch
> to implement things like ExternalFileField.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message