ctakes-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Miller (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CTAKES-41) cTAKES forces end of sentence at end of line
Date Thu, 23 May 2013 17:59:21 GMT

    [ https://issues.apache.org/jira/browse/CTAKES-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13665424#comment-13665424

Tim Miller commented on CTAKES-41:

Opennlp 1.5.3 sentence detector now allows for learning when newlines are end of sentence.
I am in the process of putting together a small sample data set to see if this can be learned
reliably. In the meantime I experimented with just removing the code that splits up newline-forced
sentences, and the results are not very good.
> cTAKES forces end of sentence at end of line
> --------------------------------------------
>                 Key: CTAKES-41
>                 URL: https://issues.apache.org/jira/browse/CTAKES-41
>             Project: cTAKES
>          Issue Type: Improvement
>            Reporter: Pei Chen
>            Priority: Minor
> Not everyone wants the current behavior, which limits sentences to single lines.
> Note that POS Tagger may need to be taken into consideration since NewlineToken inherits
from BaseToken and we don't want newlines to affect the POS assignments. 

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message