manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1270) Import OpenNLP connector into trunk
Date Tue, 26 Jan 2016 20:55:39 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117992#comment-15117992
] 

Karl Wright commented on CONNECTORS-1270:
-----------------------------------------

The size of the actual models is not scary-big:

{code}
01/26/2016  03:46 PM         5,110,658 en-ner-location.bin
01/26/2016  03:47 PM         5,297,172 en-ner-organization.bin
01/26/2016  03:46 PM         5,207,953 en-ner-person.bin
01/26/2016  03:47 PM            98,533 en-sent.bin
01/26/2016  03:46 PM           439,890 en-token.bin
{code}

So, I think either including them as a resource or downloading on the fly would work.
Unfortunately, while I can find numerous models free for the downloading on opennlp.sourceforge.net/models-1.5,
it's not clear what their license is.  It's clear that opennlp moved at some point from sourceforge
to apache, but it is not clear whether the available models came along.  So, downloading on
the fly is the only real option.




> Import OpenNLP connector into trunk
> -----------------------------------
>
>                 Key: CONNECTORS-1270
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1270
>             Project: ManifoldCF
>          Issue Type: Task
>            Reporter: Karl Wright
>            Assignee: Rafa Haro
>             Fix For: ManifoldCF 2.4
>
>
> An OpenNLP connector has been contributed on github.  Need to import it into MCF, first
to a branch, then to trunk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message