Return-Path: X-Original-To: apmail-manifoldcf-dev-archive@www.apache.org Delivered-To: apmail-manifoldcf-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1F1FA1743B for ; Wed, 26 Aug 2015 07:23:46 +0000 (UTC) Received: (qmail 52153 invoked by uid 500); 26 Aug 2015 07:23:46 -0000 Delivered-To: apmail-manifoldcf-dev-archive@manifoldcf.apache.org Received: (qmail 52098 invoked by uid 500); 26 Aug 2015 07:23:46 -0000 Mailing-List: contact dev-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@manifoldcf.apache.org Delivered-To: mailing list dev@manifoldcf.apache.org Received: (qmail 52082 invoked by uid 99); 26 Aug 2015 07:23:45 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Aug 2015 07:23:45 +0000 Date: Wed, 26 Aug 2015 07:23:45 +0000 (UTC) From: "Karl Wright (JIRA)" To: dev@manifoldcf.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CONNECTORS-1234) TikaExtractor based indexing on Elasticsearch connector MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CONNECTORS-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14712642#comment-14712642 ] Karl Wright commented on CONNECTORS-1234: ----------------------------------------- Hi Abe-san, I see no reason to include a binary content length check in this connector, since the Document Filter transformation connector has exactly the same functionality. We need to be careful not to duplicate functionality unnecessarily, or we will have a very messy situation. Or am I missing something? > TikaExtractor based indexing on Elasticsearch connector > ------------------------------------------------------- > > Key: CONNECTORS-1234 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1234 > Project: ManifoldCF > Issue Type: Improvement > Reporter: Shinichiro Abe > Assignee: Shinichiro Abe > Attachments: CONNECTORS-1234.patch > > > We could add the use-mapper-attachments flag. > Default to true, current spec which asks for mapper-attachments plugin on ES side. > If false, it allows us to index the content and metadata that extracted from files through Tika transformer, which means there is no need to install that plugin and put base64 encoded content. -- This message was sent by Atlassian JIRA (v6.3.4#6332)