manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steph van Schalkwyk (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1523) HTML Extractor transformation connector - "No englobing tag specified"
Date Thu, 09 Aug 2018 16:24:00 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16575089#comment-16575089
] 

Steph van Schalkwyk commented on CONNECTORS-1523:
-------------------------------------------------

Thank you Olivier. 

Do you know if it is available in 2.10?

Also, does it only filter element_type#id as in div#my_id or does it also filter element_type#css_class
? 

I'm crawling pages where the html has very few ids.

Regards,

Steph

 

> HTML Extractor transformation connector - "No englobing tag specified"
> ----------------------------------------------------------------------
>
>                 Key: CONNECTORS-1523
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1523
>             Project: ManifoldCF
>          Issue Type: Bug
>    Affects Versions: ManifoldCF 2.10
>            Reporter: Steph van Schalkwyk
>            Priority: Major
>
> When adding Englobing tag to HTML Extractor transformation, Englobing tag is not persisted. 
> Can add on config screen in job edit, but value is not persisted.
> Results in "No englobing tag specified".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message