datafu-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matthew Hayes (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DATAFU-88) Port Stanford Core NLP Functionality to DataFu
Date Sat, 17 Mar 2018 21:20:00 GMT

    [ https://issues.apache.org/jira/browse/DATAFU-88?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16403736#comment-16403736
] 

Matthew Hayes commented on DATAFU-88:
-------------------------------------

I propose we close this to avoid any GPL entanglements and complications with trying to work
around them.  Any objections?

> Port Stanford Core NLP Functionality to DataFu
> ----------------------------------------------
>
>                 Key: DATAFU-88
>                 URL: https://issues.apache.org/jira/browse/DATAFU-88
>             Project: DataFu
>          Issue Type: New Feature
>    Affects Versions: 1.3.0
>            Reporter: Russell Jurney
>            Assignee: Russell Jurney
>            Priority: Major
>              Labels: lemmatizer, nlp, pig, pig_udf, stanford, stemmer
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> For starters I need the Stanford Core NLP stemmer and lemmatizer. 
> It looks like maybe I can add something generic and feed arguments to code like: props.put("annotators",
"tokenize, ssplit, pos, lemma");
> Helpful example of lemmatizing at http://stackoverflow.com/questions/1578062/lemmatization-java



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message