ctakes-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CTAKES-214) too few sentences
Date Mon, 08 Jul 2013 21:05:49 GMT

    [ https://issues.apache.org/jira/browse/CTAKES-214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13702423#comment-13702423

ASF subversion and git services commented on CTAKES-214:

Commit 1500949 from [~tmill]
[ https://svn.apache.org/r1500949 ]

Fixes CTAKES-214: opennlp-1.5 is using new constants to represent sentence/non-sentence classifications.
Updated our constants to reflect new versions. Should try to directly use their constants
in case it changes in the future.
> too few sentences 
> ------------------
>                 Key: CTAKES-214
>                 URL: https://issues.apache.org/jira/browse/CTAKES-214
>             Project: cTAKES
>          Issue Type: Bug
>          Components: ctakes-core
>    Affects Versions: 3.1
>            Reporter: James Joseph Masanz
>             Fix For: 3.1
> Some sentence breaks found by cTAKES 3.0 are no longer being found by code at head of
> Discovered by running the regression test (junit within ctakes-regression-test).
> Also reproduced using the CVD GUI with launch config "UIMA_CVD--clinical_documents_pipeline"
and the following text (taken from ctakes-regression-test\testdata\input\plaintext\doc1_07543210_sample_current.txt)
> "Miss. CM is a energetic young woman who has had bouts with sleeplessness for the past
year or so.  She said that her insomnia began with the death of her father who was killed
in a train accident last year.
> Patient is 25 and claims she has smoked for the last five years or so. She used to smoke
about half a pack a day, but for the last month she has been down to about 3-5 cigarettes
a day. She is having trouble stopping altogether."
> Only 2 sentences are being found now. Previously, 5 were found.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message