manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steph van Schalkwyk (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1433) Add CLI options to pipeline modules, e.g. allow Tika to export TEXT, not BASE64
Date Thu, 22 Jun 2017 16:06:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16059588#comment-16059588
] 

Steph van Schalkwyk commented on CONNECTORS-1433:
-------------------------------------------------

Testing it now. Having some issues with not knowing the field name for the
actual Base64 output from MFC.
In the index it looks like this:
"file": {
"_content_type": "text\/html",
"_name": "Abatasa.html",
"_content": "QWJhdGFzYQpGcm9tIFdpa2lwZWRpYQpBYmF0YXNhIEFmYXI6IGEgYmE----
Have so far tried "file" and "file._content". Neither working.

+1 312 281 8982 (Tel/SMS)

On Thu, Jun 22, 2017 at 10:28 AM, Karl Wright (JIRA) <jira@apache.org>



> Add CLI options to pipeline modules, e.g. allow Tika to export TEXT, not BASE64
> -------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-1433
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1433
>             Project: ManifoldCF
>          Issue Type: Wish
>          Components: Tika extractor
>            Reporter: Steph van Schalkwyk
>            Assignee: Karl Wright
>         Attachments: CONNECTORS-1433.patch
>
>
> Would love to have Tika spout TEXT, not BASE64.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message