Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 47CD31830D for ; Wed, 16 Sep 2015 18:42:40 +0000 (UTC) Received: (qmail 40856 invoked by uid 500); 16 Sep 2015 18:42:32 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 40801 invoked by uid 500); 16 Sep 2015 18:42:32 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 40790 invoked by uid 99); 16 Sep 2015 18:42:31 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Sep 2015 18:42:31 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 6C7A1C0637 for ; Wed, 16 Sep 2015 18:42:31 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.007 X-Spam-Level: X-Spam-Status: No, score=-0.007 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.006, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id YfELKrmiDbqS for ; Wed, 16 Sep 2015 18:42:29 +0000 (UTC) Received: from zixvpm.geisinger.edu (zixvpm01.geisinger.edu [159.240.9.8]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 3B335203A0 for ; Wed, 16 Sep 2015 18:42:29 +0000 (UTC) Received: from 127.0.0.1 (ZixVPM [127.0.0.1]) by Outbound.geisinger.edu (Proprietary) with SMTP id 3D0502B53D3 for ; Wed, 16 Sep 2015 14:42:21 -0400 (EDT) Received: from GDCEXMBX202W12V.geisinger.edu (ghsnetprew8k1v.geisinger.edu [10.230.9.96]) by zixvpm.geisinger.edu (Proprietary) with ESMTP id CA5DF2B53CB for ; Wed, 16 Sep 2015 14:42:20 -0400 (EDT) Received: from LOFEXMBX207W12V.geisinger.edu (10.240.245.19) by GDCEXMBX202W12V.geisinger.edu (10.240.245.14) with Microsoft SMTP Server (TLS) id 15.0.1104.5; Wed, 16 Sep 2015 14:42:20 -0400 Received: from LOFEXMBX207W12V.geisinger.edu ([fe80::3464:515c:5d48:2220]) by LOFEXMBX207W12V.geisinger.edu ([fe80::3464:515c:5d48:2220%12]) with mapi id 15.00.1104.000; Wed, 16 Sep 2015 14:42:20 -0400 From: "Geise, Brandon D." To: "dev@ctakes.apache.org" Subject: RE: Fast Dictionary Update Thread-Topic: Fast Dictionary Update Thread-Index: AdDwpEDakN4hdJN4TiCY7ydXmSXEcgAAvDPQAAAWVDAAAKHT4AABUIQQ Date: Wed, 16 Sep 2015 18:42:19 +0000 Message-ID: <33605790b50c4a6e819d3eb44a39c722@LOFEXMBX207W12V.geisinger.edu> References: <54a6cd2b086f41ecb795c8ddbf28dc9b@LOFEXMBX207W12V.geisinger.edu> <874af4e312ed4cb688a7f9fd7b922518@LOFEXMBX207W12V.geisinger.edu> <7c53c1cc96844ee3804e2943dc01dd84@CHEXMAIL1B.CHBOSTON.ORG> In-Reply-To: <7c53c1cc96844ee3804e2943dc01dd84@CHEXMAIL1B.CHBOSTON.ORG> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [10.240.245.250] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Symantec-Inspected: Forwarded X-VPM-MSG-ID: f78b5375-6876-4635-9eff-c2fb53010cac X-VPM-HOST: zixvpm01.geisinger.edu X-VPM-GROUP-ID: dba1817b-d90a-408c-9f42-bee65a097a42 X-VPM-ENC-REGIME: Plaintext X-VPM-CERT-FLAG: 0 X-VPM-IS-HYBRID: 0 Thanks Sean, much appreciated. To clarify the example below would create t= he dictionary for use for the rare word approach? Thanks, Brandon -----Original Message----- From: Finan, Sean [mailto:Sean.Finan@childrens.harvard.edu]=20 Sent: Wednesday, September 16, 2015 2:16 PM To: dev@ctakes.apache.org Subject: RE: Fast Dictionary Update Hi Brandon, I just checked in a bin/dictionarytool.zip It should have everything that y= ou need (.jar, lib/, data/). java -cp dictionarytool.jar;lib/* org.apache.ctakes.dictionarytool.Dictiona= ryCreator2 [args] Should do the trick. To recreate a 2015 version of the current ctakes dictionary, the arguments = are:=20 -umls my/path/to/2015AA/META -fd ./data/tiny -atui ./data/tiny/CtakesAnatTu= is.txt -tui ./data/tiny/CtakesSnomedTuis.txt -db jdbc:hsqldb:file:my/path/t= o/snorx2015 -tbl CUI_TERMS Create my/path/to/snorx2015 by copying resources/memdbtemplate/ctakesumls.p= roperties to my/path/to/snorx2015.properties - there is a resources/READM= E about this. Before populating a DB, I usually do a trial run first, writing to a flat f= ile. Replace "-db ... -tbl ..." with "-ol my/path/to/testout.bsv" Sean -----Original Message----- From: Geise, Brandon D. [mailto:bdgeise@geisinger.edu] Sent: Wednesday, September 16, 2015 1:49 PM To: dev@ctakes.apache.org Subject: RE: Fast Dictionary Update Hi Sean, That'd be great. I think I'm building it incorrectly because after I build the jar and try t= o run specifying DictionaryCreator2 as the main class it says it can't find= it. I'm not too familiar with Java and building projects/jars so it could= be my ignorance causing the problem. Thanks, Brandon -----Original Message----- From: Finan, Sean [mailto:Sean.Finan@childrens.harvard.edu] Sent: Wednesday, September 16, 2015 1:45 PM To: dev@ctakes.apache.org Subject: RE: Fast Dictionary Update Hi Brandon, I can send you a jar or commit one pre-built. What goes wrong when you try= to build the tool? Sean -----Original Message----- From: Geise, Brandon D. [mailto:bdgeise@geisinger.edu] Sent: Wednesday, September 16, 2015 1:23 PM To: 'dev@ctakes.apache.org' Subject: Fast Dictionary Update Does someone have the DictionaryTool jar available? I'm having trouble cre= ating the jar file from the project and would like to be able to create an = updated UMLS fast dictionary for 2015. Thanks, Brandon IMPORTANT WARNING: The information in this message (and the documents attac= hed to it, if any) is confidential and may be legally privileged. It is int= ended solely for the addressee. Access to this message by anyone else is un= authorized. If you are not the intended recipient, any disclosure, copying,= distribution or any action taken, or omitted to be taken, in reliance on i= t is prohibited and may be unlawful. If you have received this message in e= rror, please delete all electronic copies of this message (and the document= s attached to it, if any), destroy any hard copies you may have created and= notify me immediately by replying to this email. Thank you. Geisinger Health System utilizes an encryption process to safeguard Protect= ed Health Information and other confidential data contained in external e-m= ail messages. If email is encrypted, the recipient will receive an e-mail i= nstructing them to sign on to the Geisinger Health System Secure E-mail Mes= sage Center to retrieve the encrypted e-mail.