ctakes-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Green <john.travis.gr...@gmail.com>
Subject Acronym annotator
Date Thu, 21 Aug 2014 19:08:50 GMT
Are there any acronym annotators and disambiguators? What are people doing
in production elsewhere? Im learning the heart of cTakes and UIMA by the
numbers right now and I think writing an annotator of my own will be the
best way to solidify the information. If no one has it done already, I
thought Id write a simple acronym annotator and disambiguator. The
disambiguation would just be a co-occurance over a lookup window across a
private corpus I have access to, e.g., word1 word 2 word3 acronym1 word4
word5 word6. I would provide specificity by excluding words that tend to
occur frequently across instances of the acronyms with the same

But, if someone has already done it and is planning on releasing it, I hate
to reproduce wheels...


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message