manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (CONNECTORS-1433) Add CLI options to pipeline modules, e.g. allow Tika to export TEXT, not BASE64
Date Thu, 22 Jun 2017 21:57:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16060055#comment-16060055
] 

Karl Wright edited comment on CONNECTORS-1433 at 6/22/17 9:56 PM:
------------------------------------------------------------------

OK, just started to work. I should have looked at the code earlier. So
sorry for all the unnecessary work!
This is _not_ using the mapping plugin, and specifying the content field.
I'll update the docs with an example if you want.

{code}
"decoded": "\/*\n** MediaWiki 'monobook' style sheet for CSS2-capable
browsers.\n** Copyright Gabriel Wicke - http:\/\/wikidev.net\/\n** License:
GPL (http:\/\/www.gnu.org\/copyleft\/gpl.html)\n**\n** Loosely based on
http:\/\/www.positioniseverything.net\/ordered-floats.html by Big John\n**
and the Plone 2.0 styles, see http:\/\/plone.org\/ (Alexander Limi,Joe
Geldart & Tom Croucher,\n** Michael Zeltner and Geir B�kholt)\n** All you
guys rock :)\n*\/\n\n#column-content {\n\twidth: 100%;\n\tfloat:
right;\n\tmargin: 0 0 .6em -12.2em;\n\tpadding: 0;\n}\n#content
{\n\tmargin: 2.8em 0 0 12.2em;\n\tpadding: 0 1em 1.5em 1em;\n\tposition:
relative;\n\tz-
{code}

+1 312 281 8982 (Tel/SMS)

On Thu, Jun 22, 2017 at 4:38 PM, Steph van Schalkwyk <




was (Author: svanschalkwyk):
OK, just started to work. I should have looked at the code earlier. So
sorry for all the unnecessary work!
This is _not_ using the mapping plugin, and specifying the content field.
I'll update the docs with an example if you want.

"decoded": "\/*\n** MediaWiki 'monobook' style sheet for CSS2-capable
browsers.\n** Copyright Gabriel Wicke - http:\/\/wikidev.net\/\n** License:
GPL (http:\/\/www.gnu.org\/copyleft\/gpl.html)\n**\n** Loosely based on
http:\/\/www.positioniseverything.net\/ordered-floats.html by Big John\n**
and the Plone 2.0 styles, see http:\/\/plone.org\/ (Alexander Limi,Joe
Geldart & Tom Croucher,\n** Michael Zeltner and Geir B�kholt)\n** All you
guys rock :)\n*\/\n\n#column-content {\n\twidth: 100%;\n\tfloat:
right;\n\tmargin: 0 0 .6em -12.2em;\n\tpadding: 0;\n}\n#content
{\n\tmargin: 2.8em 0 0 12.2em;\n\tpadding: 0 1em 1.5em 1em;\n\tposition:
relative;\n\tz-

+1 312 281 8982 (Tel/SMS)

On Thu, Jun 22, 2017 at 4:38 PM, Steph van Schalkwyk <



> Add CLI options to pipeline modules, e.g. allow Tika to export TEXT, not BASE64
> -------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-1433
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1433
>             Project: ManifoldCF
>          Issue Type: Wish
>          Components: Tika extractor
>            Reporter: Steph van Schalkwyk
>            Assignee: Karl Wright
>         Attachments: CONNECTORS-1433.patch, image.png, image.png
>
>
> Would love to have Tika spout TEXT, not BASE64.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message