manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <>
Subject [jira] [Commented] (CONNECTORS-1234) TikaExtractor based indexing on Elasticsearch connector
Date Thu, 27 Aug 2015 21:18:45 GMT


Karl Wright commented on CONNECTORS-1234:

Actually, now I remember.

For the Solr connector, we have to limit document sizes when it is constructing a document
in memory, so there is a specific limit for that purpose.  So that has to remain there.

Does ElasticSearch have a similar problem?

> TikaExtractor based indexing on Elasticsearch connector
> -------------------------------------------------------
>                 Key: CONNECTORS-1234
>                 URL:
>             Project: ManifoldCF
>          Issue Type: Improvement
>            Reporter: Shinichiro Abe
>            Assignee: Shinichiro Abe
>         Attachments: CONNECTORS-1234.patch
> We could add the use-mapper-attachments flag.
> Default to true, current spec which asks for mapper-attachments plugin on ES side.
> If false, it allows us to index the content and metadata that extracted from files through
Tika transformer, which means there is no need to install that plugin and put base64 encoded

This message was sent by Atlassian JIRA

View raw message