ctakes-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Joseph Masanz (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CTAKES-214) too few sentences
Date Wed, 03 Jul 2013 18:50:22 GMT
James Joseph Masanz created CTAKES-214:

             Summary: too few sentences 
                 Key: CTAKES-214
                 URL: https://issues.apache.org/jira/browse/CTAKES-214
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-core
    Affects Versions: 3.1
            Reporter: James Joseph Masanz
             Fix For: 3.1

Some sentence breaks found by cTAKES 3.0 are no longer being found by code at head of trunk.

Discovered by running the regression test (junit within ctakes-regression-test).

Also reproduced using the CVD GUI with launch config "UIMA_CVD--clinical_documents_pipeline"
and the following text (taken from ctakes-regression-test\testdata\input\plaintext\doc1_07543210_sample_current.txt)

"Miss. CM is a energetic young woman who has had bouts with sleeplessness for the past year
or so.  She said that her insomnia began with the death of her father who was killed in a
train accident last year.
Patient is 25 and claims she has smoked for the last five years or so. She used to smoke about
half a pack a day, but for the last month she has been down to about 3-5 cigarettes a day.
She is having trouble stopping altogether."

Only 2 sentences are being found now. Previously, 5 were found.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message