ctakes-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Miller (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CTAKES-41) cTAKES forces end of sentence at end of line
Date Thu, 23 May 2013 17:59:21 GMT

    [ https://issues.apache.org/jira/browse/CTAKES-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13665424#comment-13665424
] 

Tim Miller commented on CTAKES-41:
----------------------------------

Opennlp 1.5.3 sentence detector now allows for learning when newlines are end of sentence.
I am in the process of putting together a small sample data set to see if this can be learned
reliably. In the meantime I experimented with just removing the code that splits up newline-forced
sentences, and the results are not very good.
                
> cTAKES forces end of sentence at end of line
> --------------------------------------------
>
>                 Key: CTAKES-41
>                 URL: https://issues.apache.org/jira/browse/CTAKES-41
>             Project: cTAKES
>          Issue Type: Improvement
>            Reporter: Pei Chen
>            Priority: Minor
>
> Not everyone wants the current behavior, which limits sentences to single lines.
> Note that POS Tagger may need to be taken into consideration since NewlineToken inherits
from BaseToken and we don't want newlines to affect the POS assignments. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message