ctakes-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "SH.Chou" <cls3...@gmail.com>
Subject Re: Data size of UMLS 2011ab and 2016aa
Date Tue, 04 Oct 2016 19:00:41 GMT
Hi Jessica,
  Thank-You!!! I'm looking into cTAKES sandbox, but I only found this page:
https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+3.2+Dictionaries+and+Models
Do you have more details about those tools? Any directions would be
appreciated!

Thanks,
S.H.

On Tue, Oct 4, 2016 at 2:28 PM, Jessica Glover <glover.jessica.m@gmail.com>
wrote:

> Hi S.H.,
>
> cTAKES does not query the UMLS online version. The cTAKES dictionary is
> built from a subset of the UMLS Metathesaurus. There are two tools in the
> cTAKES sandbox, dictionary-gui and dictionarytool that you can use to build
> a dictionary from the subset of the UMLS Metathesaurus that you installed.
> (they use the META files, though - not the MySQL database)
> The tools allow you to choose what TUIs, vocabularies, etc. to include in
> your dictionary. These choices affect the dictionary's size.
>
> I hope that helps,
> Jessica
>
>
> On Tue, Oct 4, 2016 at 12:36 PM, SH.Chou <cls3415@gmail.com> wrote:
>
>> Hi All,
>>     I just started to use cTAKES, and have a question regarding the data
>> size of UMLS 2011ab (the default dataset in cTAKES) and new 2016aa.
>> I install 2016aa in MySQL database, the data size is about 14G~, but the
>> 2011ab in cTAKES is just 2G~. I wondered if cTAKES use UMLS API and submit
>> words to query UMLS online version?
>> Or cTAKES compressed 2011ab (using HSQL?).
>>
>> Thanks,
>> ​S.H.​
>>
>>
>>
>


-- 
=====================================
Shih-Hsiung Chou
Charlotte, NC.

Mime
View raw message