Return-Path: X-Original-To: apmail-ctakes-user-archive@www.apache.org Delivered-To: apmail-ctakes-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B2ED7FD38 for ; Mon, 22 Apr 2013 19:37:23 +0000 (UTC) Received: (qmail 49921 invoked by uid 500); 22 Apr 2013 19:37:23 -0000 Delivered-To: apmail-ctakes-user-archive@ctakes.apache.org Received: (qmail 49895 invoked by uid 500); 22 Apr 2013 19:37:23 -0000 Mailing-List: contact user-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ctakes.apache.org Delivered-To: mailing list user@ctakes.apache.org Received: (qmail 49888 invoked by uid 99); 22 Apr 2013 19:37:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Apr 2013 19:37:23 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ravigarg27@gmail.com designates 209.85.215.53 as permitted sender) Received: from [209.85.215.53] (HELO mail-la0-f53.google.com) (209.85.215.53) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Apr 2013 19:37:17 +0000 Received: by mail-la0-f53.google.com with SMTP id eg20so1963499lab.12 for ; Mon, 22 Apr 2013 12:36:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=jhrwpYWBe9N5GaNYO4dmVRn15pPLuIwoBrUEySFamU4=; b=mXQ1yJBqGGhEuRykdWdURxVVbxe9JHggE4NheYahRrRHNPo59UQO15koVmsTVIsIlS BB6thjByrUvm5KnxhlZicRVQnOmbhYe0JHMYJ8vgETVTXWdgyGKPAFUTqK0xEvSzH77R fJd8Rdkr+YaEpvNsiq2H3vadKT1gLf8JA4Skav4iwKdDwi4oj6Ehl4ySjog2HXume7LR Z6D6u4KyGS5ZTCirycM4OTgmNnSsuR/R90hkck71HJ30IPjUKXPuAm8cD4QmuNYeo//O tXbguFXorIdsS5Rx6fOFcC6TA+YloZiQj03OCewirIGu9iy++wxCNSFZJvoWo/ZoMftP ZlLw== MIME-Version: 1.0 X-Received: by 10.112.162.130 with SMTP id ya2mr13890593lbb.122.1366659416688; Mon, 22 Apr 2013 12:36:56 -0700 (PDT) Received: by 10.112.2.68 with HTTP; Mon, 22 Apr 2013 12:36:56 -0700 (PDT) In-Reply-To: <924DE05C19409B438EB81DE683A942D91050BE96@CHEXMBX1A.CHBOSTON.ORG> References: <924DE05C19409B438EB81DE683A942D91050BE96@CHEXMBX1A.CHBOSTON.ORG> Date: Tue, 23 Apr 2013 01:06:56 +0530 Message-ID: Subject: Re: Regarding Entity Recognition From: ravi garg To: user@ctakes.apache.org Content-Type: multipart/alternative; boundary=089e01182dd40f904604daf83100 X-Virus-Checked: Checked by ClamAV on apache.org --089e01182dd40f904604daf83100 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Hey, Thanks for reply. First let me brief you on what configuration I am using. I am using AggregatePlaintextProcessor.xml with DictionaryLookupAnnotar being DictionaryLookupAnnotarCSV.xml which reads dictionary from two files i.e one being the flat dictionary1.csv and another the lucene index one. I have added knee pain as single term in dictionary1.csv (like knee pain| knee pain) but still I am not being to get them as single entity. Am I missing something here? Regards, Ravi Garg On Tue, Apr 23, 2013 at 12:49 AM, Chen, Pei wrote: > Hi Ravi,**** > > Yes, in your example =93knee pain=94, the default behavior in the diction= ary > lookup will create 3 IdentifiedAnnotations**** > > =93knee=94, =93pain=94, as well as =93knee pain=94.**** > > ** ** > > [Assuming the terms exist in the UMLS dictionary]**** > > --Pei**** > > ** ** > > *From:* ravi garg [mailto:ravigarg27@gmail.com] > *Sent:* Monday, April 22, 2013 3:06 PM > *To:* user@ctakes.apache.org > *Subject:* Regarding Entity Recognition**** > > ** ** > > Hey,**** > > First of all Congrats for building such a wonderful software. I am very > new to cTAKES so had a very basic question to ask. **** > > My query is Is it possible to identify multiple words as a single entity, > for eg right now knee pain gets identified as 'knee' and 'pain', but is i= t > possible to get 'knee pain' as single identity. If so what all changes I > have to make to get going.**** > > > **** > > > -- > Ravi Garg > 3rd Year > MSc (hons) Biological Sciences > B.E (hons) Computer Science and Engineering > BITS Pilani KK Birla Goa Campus**** > --=20 Ravi Garg 3rd Year MSc (hons) Biological Sciences B.E (hons) Computer Science and Engineering BITS Pilani KK Birla Goa Campus --089e01182dd40f904604daf83100 Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable
Hey,
Thanks for reply.
First let me brief you on what configuration I am using. I am using Aggr= egatePlaintextProcessor.xml with DictionaryLookupAnnotar being DictionaryLo= okupAnnotarCSV.xml which reads dictionary from two files i.e one being the = flat dictionary1.csv and another the lucene index one. I have added knee pa= in as single term in dictionary1.csv (like knee pain| knee pain) but still = I am not being to get them as single entity. Am I missing something here?
Regards,
Ravi Garg
<= br>
On Tue, Apr 23, 2013 at 12:49 AM, Chen, P= ei <Pei.Chen@childrens.harvard.edu> wrote:

Hi Ravi,

Yes, in your example =93k= nee pain=94, the default behavior in the dictionary lookup will create 3 Id= entifiedAnnotations

=93knee=94, =93pain=94, a= s well as =93knee pain=94.

=A0<= /p>

[Assuming the terms exist= in the UMLS dictionary]

--Pei

=A0<= /p>

From: ravi gar= g [mailto:ravigar= g27@gmail.com]
Sent: Monday, April 22, 2013 3:06 PM
To: user= @ctakes.apache.org
Subject: Regarding Entity Recognition

=A0

Hey,

First of all Congrats for building such a wonderful = software. I am very new to cTAKES so had a very basic question to ask.

My query is Is it possible to identify multiple word= s as a single entity, for eg right now knee pain gets identified as 'kn= ee' and 'pain', but is it possible to get 'knee pain' a= s single identity. If so what all changes I have to make to get going.



--
Ravi Garg
3rd Year
MSc (hons) Biological Sciences
B.E (hons) Computer Science and Engineering
BITS Pilani KK Birla Goa Campus




--
Ravi Garg
3rd YearMSc (hons) Biological Sciences
B.E (hons) Computer Science and Enginee= ring
BITS Pilani KK Birla Goa Campus
--089e01182dd40f904604daf83100--