lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrzej Bialecki (JIRA)" <j...@apache.org>
Subject [jira] Created: (SOLR-1536) Support for TokenFilters that may modify input documents
Date Fri, 30 Oct 2009 18:43:59 GMT
Support for TokenFilters that may modify input documents
--------------------------------------------------------

                 Key: SOLR-1536
                 URL: https://issues.apache.org/jira/browse/SOLR-1536
             Project: Solr
          Issue Type: New Feature
          Components: Analysis
    Affects Versions: 1.5
            Reporter: Andrzej Bialecki 
         Attachments: altering.patch

In some scenarios it's useful to be able to create or modify fields in the input document
based on analysis of other fields of this document. This need arises e.g. when indexing multilingual
documents, or when doing NLP processing such as NER. However, currently this is not possible
to do.

This issue provides an implementation of this functionality that consists of the following
parts:

* DocumentAlteringFilterFactory - abstract superclass that indicates that TokenFilter-s created
from this factory may modify fields in a SolrInputDocument.

* TypeAsFieldFilterFactory - example implementation that illustrates this concept, with a
JUnit test.

* DocumentBuilder modifications to support this functionality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message