Return-Path: X-Original-To: apmail-manifoldcf-commits-archive@www.apache.org Delivered-To: apmail-manifoldcf-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5737411086 for ; Thu, 24 Jul 2014 12:34:09 +0000 (UTC) Received: (qmail 58803 invoked by uid 500); 24 Jul 2014 12:34:09 -0000 Delivered-To: apmail-manifoldcf-commits-archive@manifoldcf.apache.org Received: (qmail 58749 invoked by uid 500); 24 Jul 2014 12:34:09 -0000 Mailing-List: contact commits-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@manifoldcf.apache.org Delivered-To: mailing list commits@manifoldcf.apache.org Received: (qmail 58740 invoked by uid 99); 24 Jul 2014 12:34:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 12:34:09 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO eris.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Jul 2014 12:34:10 +0000 Received: from eris.apache.org (localhost [127.0.0.1]) by eris.apache.org (Postfix) with ESMTP id 8332923889E1; Thu, 24 Jul 2014 12:33:45 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Subject: svn commit: r1613096 - in /manifoldcf/trunk/site/src/documentation: content/xdocs/en_US/end-user-documentation.xml resources/images/en_US/tika-job-exceptions.PNG Date: Thu, 24 Jul 2014 12:33:45 -0000 To: commits@manifoldcf.apache.org From: kwright@apache.org X-Mailer: svnmailer-1.0.9 Message-Id: <20140724123345.8332923889E1@eris.apache.org> X-Virus-Checked: Checked by ClamAV on apache.org Author: kwright Date: Thu Jul 24 12:33:45 2014 New Revision: 1613096 URL: http://svn.apache.org/r1613096 Log: Update to include new tika tab Added: manifoldcf/trunk/site/src/documentation/resources/images/en_US/tika-job-exceptions.PNG (with props) Modified: manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/end-user-documentation.xml Modified: manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/end-user-documentation.xml URL: http://svn.apache.org/viewvc/manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/end-user-documentation.xml?rev=1613096&r1=1613095&r2=1613096&view=diff ============================================================================== --- manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/end-user-documentation.xml (original) +++ manifoldcf/trunk/site/src/documentation/content/xdocs/en_US/end-user-documentation.xml Thu Jul 24 12:33:45 2014 @@ -984,13 +984,18 @@ curl -XGET http://localhost:9200/index/_

As with all document transformers, more than one Tika Content Extractor transformation filter can be used in a single pipeline. In the case of the Tika Content Extractor, this does not seem to be of much utility.

The Tika Content Extractor transformation connection type does not require anything other than standard configuration information.

-

The Tika Content Extractor transformation connection type contributes a single tab to a job definition. This the "Field mapping" tab, which - looks like this:

+

The Tika Content Extractor transformation connection type contributes two tabs to a job definition. These are the "Field mapping" tab, and the "Exceptions" tab. + The "Field mapping" tab looks like this:





Enter a Tika-generated metadata field name, and a final field name, and click the "Add" button to add the mapping to the list. Uncheck the "Keep all metadata" checkbox if you want unspecified Tika metadata to be excluded from the final document.

+

The "Exceptions" tab looks like this:

+

+
+

+

Uncheck the checkbox to allow indexing of document metadata even when Tika fails to extract content from the document.

Added: manifoldcf/trunk/site/src/documentation/resources/images/en_US/tika-job-exceptions.PNG URL: http://svn.apache.org/viewvc/manifoldcf/trunk/site/src/documentation/resources/images/en_US/tika-job-exceptions.PNG?rev=1613096&view=auto ============================================================================== Binary file - no diff available. Propchange: manifoldcf/trunk/site/src/documentation/resources/images/en_US/tika-job-exceptions.PNG ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream