Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8428F18F7B for ; Tue, 2 Feb 2016 18:42:23 +0000 (UTC) Received: (qmail 47517 invoked by uid 500); 2 Feb 2016 18:42:10 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 47363 invoked by uid 500); 2 Feb 2016 18:42:10 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 47230 invoked by uid 99); 2 Feb 2016 18:42:10 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Feb 2016 18:42:10 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id ED2D6C103C for ; Tue, 2 Feb 2016 18:42:09 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0 X-Spam-Level: X-Spam-Status: No, score=0 tagged_above=-999 required=6.31 tests=[none] autolearn=disabled Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id 2jltjboNAPen for ; Tue, 2 Feb 2016 18:42:09 +0000 (UTC) Received: from mailsmtp3.childrenshospital.org (mailsmtp3.childrenshospital.org [134.174.20.73]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id E24FE20271 for ; Tue, 2 Feb 2016 18:42:08 +0000 (UTC) Received: from pps.filterd (mailsmtp3.childrenshospital.org [127.0.0.1]) by mailsmtp3.childrenshospital.org (8.15.0.59/8.15.0.59) with SMTP id u12If6RI028260 for ; Tue, 2 Feb 2016 13:42:00 -0500 Received: from smtpndc2.chboston.org (smtpndc2.chboston.org [10.20.50.105]) by mailsmtp3.childrenshospital.org with ESMTP id 20te9yca9m-1 (version=TLSv1/SSLv3 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT) for ; Tue, 02 Feb 2016 13:42:00 -0500 Received: from pps.filterd (smtpndc2.chboston.org [127.0.0.1]) by smtpndc2.chboston.org (8.15.0.59/8.15.0.59) with SMTP id u12IfxWY010848 for ; Tue, 2 Feb 2016 13:41:59 -0500 Received: from chexmail1a.chboston.org (internal-ndc-nat-v1260.tch.harvard.edu [10.20.50.4]) by smtpndc2.chboston.org with ESMTP id 20rr0ujb9c-1 for ; Tue, 02 Feb 2016 13:41:59 -0500 Received: from CHEXMAIL1B.CHBOSTON.ORG (10.20.18.127) by CHEXMAIL1A.CHBOSTON.ORG (10.20.50.127) with Microsoft SMTP Server (TLS) id 15.0.1130.7; Tue, 2 Feb 2016 13:41:58 -0500 Received: from CHEXMAIL1B.CHBOSTON.ORG ([fe80::94ec:d731:f3a5:1e4d]) by CHEXMAIL1B.CHBOSTON.ORG ([fe80::94ec:d731:f3a5:1e4d%21]) with mapi id 15.00.1130.005; Tue, 2 Feb 2016 13:41:58 -0500 From: "Finan, Sean" To: "dev@ctakes.apache.org" Subject: RE: Distributing modifiers across conjunctions Thread-Topic: Distributing modifiers across conjunctions Thread-Index: AQHRXd9y/HwJahEWqE24WBSVlkT75p8ZFbRw Date: Tue, 2 Feb 2016 18:41:57 +0000 Message-ID: References: <03D94C27-5466-4962-96D6-84FA0E929B4E@ucsf.edu> In-Reply-To: <03D94C27-5466-4962-96D6-84FA0E929B4E@ucsf.edu> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [10.7.2.26] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2016-02-02_11:,, signatures=0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2016-02-02_11:,, signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 suspectscore=0 malwarescore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1601100000 definitions=main-1602020300 Hi John, One thing that you can try is using the "Overlap" version of the dictionary= lookup. It should identify both "right .... lower quadrant pain" and "lef= t lower quadrant pain" as Sign/Symptoms. Check https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+3.2+-+Fast+= Dictionary+Lookup almost at the bottom: "Text Overlap Match" Sean -----Original Message----- From: John Mongan [mailto:john.mongan@ucsf.edu]=20 Sent: Tuesday, February 02, 2016 12:30 PM To: dev@ctakes.apache.org Subject: Distributing modifiers across conjunctions I'm working with some text that has a lot of phrases of the form: right and left lower quadrant pain In processing this text, I would like to identify the concepts "right lower= quadrant pain" and "left lower quadrant pain", but instead the concepts th= at get identified are "right" and "left lower quadrant pain" I've got some ideas about how I could fix this, but wanted to see if this i= s a problem that others have worked on so I don't unnecessarily reinvent an= y wheels. Thanks, John