Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E133A1109B for ; Sat, 16 Aug 2014 16:07:50 +0000 (UTC) Received: (qmail 91787 invoked by uid 500); 16 Aug 2014 16:07:50 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 91732 invoked by uid 500); 16 Aug 2014 16:07:50 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 91720 invoked by uid 99); 16 Aug 2014 16:07:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Aug 2014 16:07:50 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of john.travis.green@gmail.com designates 209.85.192.47 as permitted sender) Received: from [209.85.192.47] (HELO mail-qg0-f47.google.com) (209.85.192.47) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Aug 2014 16:07:45 +0000 Received: by mail-qg0-f47.google.com with SMTP id i50so3085069qgf.20 for ; Sat, 16 Aug 2014 09:07:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:mime-version:message-id:in-reply-to:references:from:to:cc :subject:content-type; bh=RtPZw5DhCh37A7IVItRNTZmbghRFYj3V2aZLPo4cw98=; b=A1wKy76G/Ucek0cBXkpp2QCs6w6OiOU8DWMeSc+7z6vgj8UNXyrLcCvb5ZvvxTmOCV A0601cNX1V4SO+wLrWJpThp0vU/Bs5fAQw17oOGJHSHuZWAbVrpPyQXZdDP5FbpzSee9 qkdJHdLBkJOFas1md4RyeFxpo+IUSvNnF0upVhLZXoju0yw2OhlObfcTmCyv3Wmr9GB3 sSt3MKFrqsZv7QYnflP5+YyYHoOx9UqGBdon5HGOdnYc4dQUCz7QfIHy50O5Oq93Eb6P miSwklImvPYxaqxtOk/MMgpP6s5ec6+beBVzIzkqBMEzhnDDEyMnENEvYDK8TuHJxkFF /K0g== X-Received: by 10.224.131.8 with SMTP id v8mr39979279qas.31.1408205244257; Sat, 16 Aug 2014 09:07:24 -0700 (PDT) Received: from hedwig-7.prd.orcali.com (ec2-54-85-253-15.compute-1.amazonaws.com. [54.85.253.15]) by mx.google.com with ESMTPSA id c8sm19934219qaj.16.2014.08.16.09.07.23 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sat, 16 Aug 2014 09:07:23 -0700 (PDT) Date: Sat, 16 Aug 2014 09:07:23 -0700 (PDT) X-Google-Original-Date: Sat, 16 Aug 2014 16:07:23 GMT MIME-Version: 1.0 X-Mailer: Nodemailer (0.5.0; +http://www.nodemailer.com/) Message-Id: <1408205243431.cc29641b@Nodemailer> In-Reply-To: References: X-Orchestra-Oid: E63FCF3A-FB00-4087-A4D3-5479A4FBC173 X-Orchestra-Sig: b57f2992c45157bcd975daefd2eb07382439f11e X-Orchestra-Thrid: TEAE0E204-147D-4C4C-82EB-B244E06A87A6_1476575368987723118 X-Orchestra-Thrid-Sig: 77dfebc00c7cc5fe6e12d4213225f88bf0c8f579 X-Orchestra-Account: 14e30bd82ddcb30596ecbc12ea5afba6684e7fa5 From: "John Green" To: dev@ctakes.apache.org Cc: dev@ctakes.apache.org Subject: Re: Change from SNOMEDCT to SNOMEDCT_US affecting v_snomed_fword_lookup Content-Type: multipart/alternative; boundary="----Nodemailer-0.5.0-?=_1-1408205243640" X-Virus-Checked: Checked by ClamAV on apache.org ------Nodemailer-0.5.0-?=_1-1408205243640 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Great catch!=E2=80=94 Sent from Mailbox for iPhone On Sat, Aug 16, 2014 at 2:53 AM, Tim O'Connell wrote: > Hi folks, > I was having an issue with the current build (from svn) of ctakes/ytex = not > identifying any annotations as some folks on this board. I traced it to > the fact that the UMLS database has at sometime in the relatively recent > past changed the SAB tag in the MRCONSO table for SNOMED terms from > SNOMEDCT to SNOMEDCT=5FUS. I just had a newer version of UMLS that uses > SNOMEDCT=5FUS. Thus when the install script tried to create the > v=5Fsnomed=5Ffword=5Flookup table, it wasn't finding any of the SNOMEDCT = terms, > thus nothing was getting annotated. > The ytex install script was just looking for things in MRCONSO with the > SNOMEDCT SAB tag when it created the ytex lookup table - so, by changing > this to SNOMEDCT=5FUS in the file > CTAKES=5FHOME/bin/ctakes-ytex/scripts/data/mysql/umls/insert=5Fview=5Ftem= plate.sql > it now works (for mysql users) to find the annotations. You can just = re-run > the ytex setup script, but that takes hours - instead, I just deleted = all > the data from the v=5Fsnomed=5Ffword=5Flookup table and basically ran the= sql > command to repopulate the table and it worked fine. Here's the code, n.b.= > my schema name for my umls database is 'umls' - change the code below if > yours is different. > delete from v=5Fsnomed=5Ffword=5Flookup; > insert into v=5Fsnomed=5Ffword=5Flookup (cui, tui, fword, fstem, = tok=5Fstr, > stem=5Fstr) > select mrc.cui, t.tui, c.fword, c.fstem, c.tok=5Fstr, c.stem=5Fstr > from umls=5Faui=5Ffword c > inner join umls.MRCONSO mrc on c.aui =3D mrc.aui and mrc.SAB in ( > 'SNOMEDCT=5FUS', 'RXNORM') > inner join > ( > select cui, min(tui) tui > from umls.MRSTY sty > where sty.tui in > ( > 'T019', 'T020', 'T037', 'T046', 'T047', 'T048', 'T049', 'T050', > 'T190', 'T191', 'T033', > 'T184', > 'T017', 'T029', 'T023', 'T030', 'T031', 'T022', 'T025', 'T026', > 'T018', 'T021', 'T024', > 'T116', 'T195', 'T123', 'T122', 'T118', 'T103', 'T120', 'T104', > 'T200', 'T111', 'T196', 'T126', 'T131', 'T125', 'T129', 'T130', > 'T197', 'T119', 'T124', 'T114', 'T109', 'T115', 'T121', 'T192', > 'T110', 'T127', > 'T060', 'T065', 'T058', 'T059', 'T063', 'T062', 'T061', > 'T074', 'T075', > 'T059' > ) > group by cui > ) t on t.cui =3D mrc.cui > ; > Hope it helps - cheers, > Tim ------Nodemailer-0.5.0-?=_1-1408205243640--