manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rafa Haro (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1181) Apache Stanbol Transformation Connector
Date Thu, 11 Feb 2016 08:54:18 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15142437#comment-15142437
] 

Rafa Haro commented on CONNECTORS-1181:
---------------------------------------

I have created the branch and imported the connector using the pull request patch: https://svn.apache.org/repos/asf/manifoldcf/branches/CONNECTORS-1181.
I didn't have time yet to review the code but probably it is going to need similar changes
than OpenNLP connector: chunking the input stream for enhancing

> Apache Stanbol Transformation Connector
> ---------------------------------------
>
>                 Key: CONNECTORS-1181
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1181
>             Project: ManifoldCF
>          Issue Type: Wish
>    Affects Versions: ManifoldCF 1.8.2, ManifoldCF 2.0.2
>            Reporter: Rafa Haro
>            Assignee: Rafa Haro
>            Priority: Minor
>              Labels: connect, transformation
>             Fix For: ManifoldCF 2.4
>
>
> Apache Stanbol (https://stanbol.apache.org/) provides a set of reusable components for
semantic content management. One of this component is the Enhancer (https://stanbol.apache.org/docs/trunk/components/enhancer/)
which allows to extract features and semantic metadata from textual content like entities/concepts
from domain ontologies, named entities and so on.
> Apache Stanbol provides an easy-to-use REST API. The main idea behind this transformation
connector would be to enrich the Repository Document's (string) content with a configured
Stanbol processing chain. The Transformation Connector would allow the user to configure the
metadata that will be extracted from the Enhancer result for including it as RD's metadata
> This behavior come to somehow replace the functionality of the old Apache Stanbol CMS
Adapter (https://stanbol.apache.org/docs/trunk/components/cmsadapter/) and ContentHub (https://stanbol.apache.org/docs/trunk/components/contenthub/)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message