manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <>
Subject [jira] [Commented] (CONNECTORS-1066) Don't send empty fields to Solr
Date Wed, 08 Oct 2014 17:11:34 GMT


Karl Wright commented on CONNECTORS-1066:

bq. It is wrong to put blank to RepositoryDocument in repository connector.

I am not sure I understand this.  Perhaps you can clarify?

The way I am thinking is this: In a job, we do not know either full details of the repository
connector or output connector.  We do not know if the repository can include blank values
or not, and we do not know if the output index treats blank values the same as null values.
 Therefore, the *framework* must not make any judgements here.  But still, it is reasonable
to have an *option* to remove blank metadata values from a document, because as Alessandro
points out, most of the time these are just confusing to people.  I believe that this should
not happen silently, but since it is common, it makes sense for it to happen *easily*, without
a lot of configuration work.

My suggestion to add this functionality to the Metadata Adjuster transformer would achieve
this goal, especially if the default configuration of the Metadata Adjuster transformer would
be to remove blanks.  There would have to be a checkbox that you could uncheck to leave blanks
in the metadata.

> Don't send empty fields to Solr
> -------------------------------
>                 Key: CONNECTORS-1066
>                 URL:
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: Lucene/SOLR connector
>    Affects Versions: Manifold 1.7.1
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>              Labels: solr
>             Fix For: ManifoldCF 2.0
>         Attachments: CONNECTORS-1066.patch
> Don't transmit empty fields to Solr.

This message was sent by Atlassian JIRA

View raw message