Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6CE0310AEC for ; Wed, 4 Sep 2013 13:17:10 +0000 (UTC) Received: (qmail 6440 invoked by uid 500); 4 Sep 2013 13:17:10 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 5736 invoked by uid 500); 4 Sep 2013 13:17:04 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 5728 invoked by uid 99); 4 Sep 2013 13:17:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Sep 2013 13:17:02 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [74.125.82.176] (HELO mail-we0-f176.google.com) (74.125.82.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Sep 2013 13:16:56 +0000 Received: by mail-we0-f176.google.com with SMTP id u56so314663wes.35 for ; Wed, 04 Sep 2013 06:16:16 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=Hyvn1lmmqJgPKCOTxHXLxtfOihOy8ZMXrbL6Y240h0c=; b=bE2xeBti/0ieXqGcvdXQgRpvj2eLbkAYKPQjSlwru+sYC4L++khgIHm8RRQS0QrQ14 VKmTzd/h7sJAoS7Mp3atBycdIZowkfc6rkygxKtMy8JkO79sYlY5bEJblrcykLoUDYjC n4kBF+AuVCr5i6yHrjew5UfL233LbRmXRp9dMx91EbqQ1I5qgP1uTP/E8TNJ3WUWTbWw RQO/U2/zoniPhOptHLiZvXDz+K5xVeVw1HoIcfPYvVEaY66n4P0O65hNBD2GjhYYugOQ WLBmO4HJYmOYmC9RctEGXnRPTyCKXD9Z3594R/L/MD+Dm7fw9Bc9NQcrWAJvSJvwr7Ze Qnow== X-Gm-Message-State: ALoCoQn+3Pwy8PlLbFvLNXbcbVEp99lOXmyHNRvZ1XjEzhEDXtbWr2sTpSw7KGWuDSgBPR/+dGyt MIME-Version: 1.0 X-Received: by 10.194.63.228 with SMTP id j4mr2427802wjs.34.1378300575991; Wed, 04 Sep 2013 06:16:15 -0700 (PDT) Received: by 10.194.201.227 with HTTP; Wed, 4 Sep 2013 06:16:15 -0700 (PDT) In-Reply-To: <924DE05C19409B438EB81DE683A942D9105A6130@CHEXMBX1A.CHBOSTON.ORG> References: <924DE05C19409B438EB81DE683A942D9105A5977@CHEXMBX1A.CHBOSTON.ORG> <924DE05C19409B438EB81DE683A942D9105A6130@CHEXMBX1A.CHBOSTON.ORG> Date: Wed, 4 Sep 2013 18:46:15 +0530 Message-ID: Subject: Re: Information Regarding Apache cTAKES-3.0 From: Arohi Kumar To: "Chen, Pei" Cc: "dev@ctakes.apache.org" Content-Type: multipart/alternative; boundary=047d7ba97f3a39dc1d04e58e9c90 X-Virus-Checked: Checked by ClamAV on apache.org --047d7ba97f3a39dc1d04e58e9c90 Content-Type: text/plain; charset=ISO-8859-1 Hi Pei, Thank you for pointing me to the resources. ~Arohi On Wed, Sep 4, 2013 at 1:08 AM, Chen, Pei wrote: > [+dev] > Hi Arohi, > I'm glad that you have it working. > To get started, I think a good place to get started would be to take a > look at the current type system[1] which outlines the output[2] that cTAKES > currently supports. > As you already found, the IdentifiedAnnotation (and it's subsclasses such > as xMention, and the UmlsConcept codes) > Unfortunately, there isn't much more documentation than what's in the > current guides[4] at this point in time. However, the mailing lists are a > great place to look for answers you may have. To learn more about the flow > of control of the code, you may want to check out the UIMA [4] framework > which cTAKES is built on top of. > [1] > http://ctakes.apache.org/user-faqs.html#what-are-the-available-attributes-types-in-ctakes > [2] > http://svn.apache.org/repos/asf/ctakes/trunk/ctakes-type-system/src/main/resources/org/apache/ctakes/typesystem/types/TypeSystem.xml > [3] > https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+3.1+Component+Use+Guide > [4] http://uima.apache.org > > I hope that helps. > --Pei > > From: Arohi Kumar [mailto:arohi@mobipulse.in] > Sent: Tuesday, September 03, 2013 2:53 PM > To: Chen, Pei > Subject: Re: Information Regarding Apache cTAKES-3.0 > > Hi Pei, > Thanks for your suggestion. That worked like a charm. I also made it work > using Lucene 3.6 to write a new index which was subsequently readable by > the Lucene 4.0 jars present in the project. Just for curiosity, I have > found(by experimenting with Lucene versions) that the original OrangeBook > index was written to by a Lucene version preceding 1.9. Hope that I am > right? > Now that I am obtaining the output : > I want to be able to understand what I am getting. I have looked at the > output and things like the LookupWindowAnnotation, SignSymptomMention, > Concept, UmlsConcept jump out as being really useful. I want to understand > the other outputs as well as how the code gave them to me. I have looked at > the Component Use Guide, which gives me a overall idea of the cTAKES > pipeline. I am looking for a more detailed explanation. > > I understand that ultimately I will have to get my hands dirty and delve > into the code. Are there any other resources for helping me get started > like an explanation of the output and the flow of control of the code. > Thank you > Arohi Kumar > Ex-CSE, IIT Kharagpur > > On Tue, Sep 3, 2013 at 7:07 PM, Chen, Pei > wrote: > Hi Arohi, > OrangeBook is included in cTAKES' ctakes-dictionary-lookup-res project now: > > http://svn.apache.org/repos/asf/ctakes/trunk/ctakes-dictionary-lookup-res/src/main/resources/org/apache/ctakes/dictionary/lookup/ > Feel free to let us know if that works for you. > --Pei > > From: Arohi Kumar [mailto:arohi@mobipulse.in] > Sent: Tuesday, September 03, 2013 6:29 AM > To: Chen, Pei > Subject: Re: Information Regarding Apache cTAKES-3.0 > > I'm sorry, the link is > > https://sourceforge.net/p/ctakesresources/code/HEAD/tree/trunk/ctakes-resources-dictionary/src/main/resources/org/apache/ctakes/dictionary/lookup/OrangeBook/ > > On Tue, Sep 3, 2013 at 3:58 PM, Arohi Kumar wrote: > Hi Pei, > I am a newbie and learning Apache cTAKES-3.0 for a project. > I was facing an error which was caused when lucene-4.0(included in Apache > cTAKES) tries to read the OrangeBook index. > I went through the mail archives and found that clearing up and replacing > the OrangeBook index with > > > will solve the problem. The above link seems to be broken. I will be > grateful if you could send me an updated link if one exists. > Some alternative ways of solving the problem: > 1. Since the orangebook index has size of only 19,000(approx), I think > that we can also write a new index using lucene-3.0(because 4.0 is able to > read indexes written by 3.0 and later). > 2. Change the lucene-4.0 jars in maven dependency to lucene-3.0 jars, but > that would lead to dependencies being broken and so, I don't want to get > into that. > You suggestions are most welcome. > Thanks > Arohi Kumar > Ex- CSE, IIT Kharagpur > > > > --047d7ba97f3a39dc1d04e58e9c90--