Return-Path: X-Original-To: apmail-ctakes-user-archive@www.apache.org Delivered-To: apmail-ctakes-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A4C421788B for ; Tue, 7 Oct 2014 18:40:39 +0000 (UTC) Received: (qmail 16930 invoked by uid 500); 7 Oct 2014 18:40:39 -0000 Delivered-To: apmail-ctakes-user-archive@ctakes.apache.org Received: (qmail 16894 invoked by uid 500); 7 Oct 2014 18:40:39 -0000 Mailing-List: contact user-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@ctakes.apache.org Delivered-To: mailing list user@ctakes.apache.org Received: (qmail 16884 invoked by uid 99); 7 Oct 2014 18:40:39 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Oct 2014 18:40:39 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.220.41] (HELO mail-pa0-f41.google.com) (209.85.220.41) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Oct 2014 18:40:34 +0000 Received: by mail-pa0-f41.google.com with SMTP id eu11so7632355pac.14 for ; Tue, 07 Oct 2014 11:40:12 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:message-id:date:from:user-agent:mime-version:to :subject:references:in-reply-to:content-type; bh=ueh9sTob1k81Zny9pFAx4P2Wq9/vg5aC0dzf33Oi02Y=; b=ASNoARwyx5mzoKVbo0Wvr3XL6vWOGDfnba07aIAb+MWiKwTeEF/OljR+7bDiGEv73f V9/eP3ztCh9jdN7dKOf3xjPXl5P/CMaJmVxt5F9t4d1xgWUY3/4z2sS0Gp+c6ne4kD8E r7y/8qMI8PSmqF8AmGpuka1MVmj9JSsaGMTrtFYDPFmH5doE/dWl5BcMi73hDO/HPvvl xDcjwXUc/ShOLhcsbBvlBE2BY3qp63U+qOfoWwpfpkKV9bMx98tHP+OLPyynSlQPX3Vg 9p/sdAlha51Gbyl5KGYG6x7wsLKec9RK+OGEcKYXJbk5UO1TlBLYNYlXnBgrJN2oFydY vl/Q== X-Gm-Message-State: ALoCoQnCCmo6o6Tfsmu9LNVA8Tej12oNBphhcbpk8+IhyclelGSlxF8IVzgiKUzUkxQFXNWHrnDf X-Received: by 10.68.209.194 with SMTP id mo2mr5370193pbc.80.1412707212779; Tue, 07 Oct 2014 11:40:12 -0700 (PDT) Received: from localhost.localdomain (184-155-223-24.cpe.cableone.net. [184.155.223.24]) by mx.google.com with ESMTPSA id tv4sm16922015pab.28.2014.10.07.11.40.11 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 07 Oct 2014 11:40:12 -0700 (PDT) Message-ID: <5434338A.5090101@perfectsearchcorp.com> Date: Tue, 07 Oct 2014 12:40:10 -0600 From: Kim Ebert User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.7.0 MIME-Version: 1.0 To: user@ctakes.apache.org, natalia.v.connolly@gmail.com Subject: Re: Negative polarity - why? References: In-Reply-To: Content-Type: multipart/alternative; boundary="------------070003020502020202090309" X-Virus-Checked: Checked by ClamAV on apache.org This is a multi-part message in MIME format. --------------070003020502020202090309 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Hi Natalia, Here are a few variations on your sentence with the results. The following results in a polarity of 1 for all of the DiseaseDisorderMentions. "Unspecified pervasive developmental disorder, active state" "Unspecified pervasive developmental disorder,*current or *active state" results with one negative polarity. "*- *Unspecified pervasive developmental disorder, current or active state*- *" results in three negative polarities. "*- *Unspecified pervasive developmental disorder, current or active state" results are the same. Oddly, "Unspecified pervasive developmental disorder, current or active state -" results in one negative polarity. I hope this helps. Kim Ebert 1.801.669.7342 Perfect Search Corp http://www.perfectsearchcorp.com/ On 09/30/2014 09:14 AM, Natalia Connolly wrote: > Dear cTAKES Experts, > > I have a piece of free text that includes a diagnosis in a > stand-alone sentence, like this: > > " - Unspecified pervasive developmental disorder, current or active > state - " > > For some reason cTAKES seems to think the polarity of this > statement is negative: > > _id="18158" _ref_sofa="3" begin="936" end="980" conceptType="PROBLEM" > conceptText="Unspecified pervasive developmental disorder" > externalId="0" originalEntityExternalId="8563"/> > > _indexed="1" _id="8563" _ref_sofa="3" begin="936" end="980" id="40" > _ref_ontologyConceptArr="8556" typeID="2" segmentID="SIMPLE_SEGMENT" > discoveryTechnique="1" confidence="1.0" *polarity="-1"* > uncertainty="0" conditional="false" generic="false" subject="patient" > historyOf="0"/> > > Why is that?? Can it be the hyphens? > > Thanks for any insight, > > Natalia Connolly > --------------070003020502020202090309 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: 8bit
Hi Natalia,

Here are a few variations on your sentence with the results.

The following results in a polarity of 1 for all of the DiseaseDisorderMentions. "Unspecified pervasive developmental disorder, active state"

    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="180" _ref_sofa="1" begin="0" end="44" id="0" _ref_ontologyConceptArr="173" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="233" _ref_sofa="1" begin="22" end="44" id="2" _ref_ontologyConceptArr="229" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="1" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="276" _ref_sofa="1" begin="36" end="44" id="3" _ref_ontologyConceptArr="273" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="1" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="359" _ref_sofa="1" begin="12" end="44" id="1" _ref_ontologyConceptArr="352" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>

"Unspecified pervasive developmental disorder, current or active state" results with one negative polarity.

    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="204" _ref_sofa="1" begin="0" end="44" id="0" _ref_ontologyConceptArr="197" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="257" _ref_sofa="1" begin="22" end="44" id="2" _ref_ontologyConceptArr="253" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="1" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="300" _ref_sofa="1" begin="36" end="44" id="3" _ref_ontologyConceptArr="297" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="1" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="383" _ref_sofa="1" begin="12" end="44" id="1" _ref_ontologyConceptArr="376" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>


" - Unspecified pervasive developmental disorder, current or active state - " results in three negative polarities.

    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="180" _ref_sofa="1" begin="39" end="47" id="3" _ref_ontologyConceptArr="177" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="true" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="263" _ref_sofa="1" begin="3" end="47" id="0" _ref_ontologyConceptArr="256" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="346" _ref_sofa="1" begin="15" end="47" id="1" _ref_ontologyConceptArr="339" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="399" _ref_sofa="1" begin="25" end="47" id="2" _ref_ontologyConceptArr="395" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>

" - Unspecified pervasive developmental disorder, current or active state" results are the same.
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="172" _ref_sofa="1" begin="39" end="47" id="3" _ref_ontologyConceptArr="169" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="1" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="255" _ref_sofa="1" begin="3" end="47" id="0" _ref_ontologyConceptArr="248" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="338" _ref_sofa="1" begin="15" end="47" id="1" _ref_ontologyConceptArr="331" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="391" _ref_sofa="1" begin="25" end="47" id="2" _ref_ontologyConceptArr="387" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>


Oddly, "Unspecified pervasive developmental disorder, current or active state -" results in one negative polarity.

    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="212" _ref_sofa="1" begin="0" end="44" id="0" _ref_ontologyConceptArr="205" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="265" _ref_sofa="1" begin="22" end="44" id="2" _ref_ontologyConceptArr="261" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="true" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="308" _ref_sofa="1" begin="36" end="44" id="3" _ref_ontologyConceptArr="305" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="true" generic="false" subject="patient" historyOf="0"/>
    <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="391" _ref_sofa="1" begin="12" end="44" id="1" _ref_ontologyConceptArr="384" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="1" uncertainty="0" conditional="true" generic="false" subject="patient" historyOf="0"/>

I hope this helps.


Kim Ebert
1.801.669.7342
Perfect Search Corp
http://www.perfectsearchcorp.com/
On 09/30/2014 09:14 AM, Natalia Connolly wrote:
Dear cTAKES Experts,
 
    I have a piece of free text that includes a diagnosis in a stand-alone sentence, like this:

" - Unspecified pervasive developmental disorder, current or active state - "

     For some reason cTAKES seems to think the polarity of this statement is negative:

  <org.apache.ctakes.assertion.medfacts.types.Concept _indexed="1" _id="18158" _ref_sofa="3" begin="936" end="980" conceptType="PROBLEM" conceptText="Unspecified pervasive developmental disorder" externalId="0" originalEntityExternalId="8563"/>

 <org.apache.ctakes.typesystem.type.textsem.DiseaseDisorderMention _indexed="1" _id="8563" _ref_sofa="3" begin="936" end="980" id="40" _ref_ontologyConceptArr="8556" typeID="2" segmentID="SIMPLE_SEGMENT" discoveryTechnique="1" confidence="1.0" polarity="-1" uncertainty="0" conditional="false" generic="false" subject="patient" historyOf="0"/>
   
     Why is that??  Can it be the hyphens?

     Thanks for any insight,

     Natalia Connolly


--------------070003020502020202090309--