ctakes-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Joseph Masanz (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CTAKES-231) missing NEs because of inconsistent chunking for parallel sentence constructions
Date Thu, 29 Aug 2013 15:35:52 GMT
James Joseph Masanz created CTAKES-231:

             Summary: missing NEs because of inconsistent chunking for parallel sentence constructions
                 Key: CTAKES-231
                 URL: https://issues.apache.org/jira/browse/CTAKES-231
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-chunker
    Affects Versions: 3.0-incubating
            Reporter: James Joseph Masanz
         Attachments: liver.cancer.chunking.issue.xmi.xml

cancer of colon, lung and liver
results in an annotation for liver cancer

cancer of colon, liver and lung.
does not result in an annotation for liver cancer or for lung cancer.

Thanks Dennis Lee Hon Kit for reporting this.


Reproduced by running 3.0.0-incubating with the separately downloadable UMLS resources, using
the AggregatePlaintextUMLSProcessor.xml, results in these chunk annotations:

 [0] org.apache.ctakes.typesystem.type.syntax.NP
 [1] org.apache.ctakes.typesystem.type.syntax.PP
 [2] org.apache.ctakes.typesystem.type.syntax.NP
 [3] org.apache.ctakes.typesystem.type.syntax.NP
 [4] org.apache.ctakes.typesystem.type.syntax.PP
 [5] org.apache.ctakes.typesystem.type.syntax.NP
 [6] org.apache.ctakes.typesystem.type.syntax.O
 [7] org.apache.ctakes.typesystem.type.syntax.O
 [8] org.apache.ctakes.typesystem.type.syntax.NP

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message