ctakes-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "John Green" <john.travis.gr...@gmail.com>
Subject Re: Acronym annotator
Date Fri, 22 Aug 2014 14:25:31 GMT
Thanks for the tip! Im so new to this field. By the by Vijay - your semantic similarity paper
was outstanding, I enjoyed reading it very much.

Well, it doesnt have to be the best, just ok. Its mostly because I dont see one in the project
now and its a project for me to solidify the process of making my own first annotator. I would
like it to be useful though, so ill look into the literature.

Sent from Mailbox for iPhone

On Fri, Aug 22, 2014 at 7:12 AM, vijay garla <vngarla@gmail.com> wrote:

> This is a type of word sense disambiguation; there is a lot of literature
> on this subject.  Co-occurence is one way of doing it, not necessarily the
> best; you need a ton of annotated data for it to work well.
> On Thu, Aug 21, 2014 at 9:08 PM, John Green <john.travis.green@gmail.com>
> wrote:
>> Are there any acronym annotators and disambiguators? What are people doing
>> in production elsewhere? Im learning the heart of cTakes and UIMA by the
>> numbers right now and I think writing an annotator of my own will be the
>> best way to solidify the information. If no one has it done already, I
>> thought Id write a simple acronym annotator and disambiguator. The
>> disambiguation would just be a co-occurance over a lookup window across a
>> private corpus I have access to, e.g., word1 word 2 word3 acronym1 word4
>> word5 word6. I would provide specificity by excluding words that tend to
>> occur frequently across instances of the acronyms with the same
>> abbreviation.
>> But, if someone has already done it and is planning on releasing it, I hate
>> to reproduce wheels...
>> JG
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message