ctakes-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "britt fitch (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CTAKES-371) update PTB tokenization logic in fast dictionary module
Date Mon, 20 Jul 2015 14:40:05 GMT
britt fitch created CTAKES-371:
----------------------------------

             Summary: update PTB tokenization logic in fast dictionary module
                 Key: CTAKES-371
                 URL: https://issues.apache.org/jira/browse/CTAKES-371
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-dictionary-lookup
    Affects Versions: 3.2.2
            Reporter: britt fitch
            Assignee: Sean Finan
             Fix For: 3.2.3


PTB tokenization logic is used in places like the tokenizer & dictionary building code.


For example, given “22q11.2 deletion syndrome”:

PTB tokenizer: [22q11, .2, deletion, syndrome]

Dictionary module:  [22q11, ., 2, deletion, syndrome]   
(RareWordTermMapCreator.getTokens)

Dictionary module should be updated to match PTB tokenization logic used elsewhere in ctakes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message