Return-Path: X-Original-To: apmail-uima-user-archive@www.apache.org Delivered-To: apmail-uima-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 20DDF10FC9 for ; Thu, 19 Feb 2015 15:18:31 +0000 (UTC) Received: (qmail 66487 invoked by uid 500); 19 Feb 2015 15:18:30 -0000 Delivered-To: apmail-uima-user-archive@uima.apache.org Received: (qmail 66447 invoked by uid 500); 19 Feb 2015 15:18:30 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Delivered-To: moderator for user@uima.apache.org Received: (qmail 49196 invoked by uid 99); 19 Feb 2015 11:51:36 -0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of albertogarcia.garcia@gmail.com designates 209.85.215.47 as permitted sender) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=NfDPgOLsKQmhzXacdNQe/p5CS9eqB+RUzkEUZqjTM0U=; b=nwfstMl5IF6Zao2vbLJ2N/6q6ydvXIMpIxcfWOPqogzJE/aEuLc5qkvbVqLQK/Tnw4 PZhmpHmsTru+xULQdqoDjH8HQlEvxmNN5U1XFNI0/5vOVQPTrKpJxV2/7Vu/Hn/N49RP OSE3sdli1BowX7gAXhI08ZVectdsoNE2NVB2VxOqfM0/q1CoSu8x2G+qNWEWhUZR+Daq NsB4V3k5lYZV0SoBVXLjY2DnJfg+QMgn24Y983s9xZKpRvz153rtilXpRMRIp9hxgsVJ Or5OJ0MVYJJHtfW4iguEjVQYONUEA0uNzWnDn9Yum5HTMwg3bfXEYcHHLyiIlVXpH20Q ue5Q== X-Received: by 10.152.121.66 with SMTP id li2mr3740773lab.40.1424346670427; Thu, 19 Feb 2015 03:51:10 -0800 (PST) MIME-Version: 1.0 From: Alberto Garcia Date: Thu, 19 Feb 2015 11:50:50 +0000 Message-ID: Subject: Concept Mapper Annotator Question To: user@uima.apache.org Content-Type: multipart/alternative; boundary=089e0112c88653f619050f6f8d16 X-Virus-Checked: Checked by ClamAV on apache.org --089e0112c88653f619050f6f8d16 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable We are starting to use UIMA framework for entity identification. We base our solution on some dictionaries which contains the entities we need to recognize. We are using the Concept Mapper annotator, and it works really fast recognizing the complete name of an entity, but it fails recognizing part of the entity, let me explain that with an example, Lets say we have this entry on the dictionary: If we call the service with =E2=80=9C*New York City=E2=80=9D *as input text= it recognize the entity as Location, If we call the service with =E2=80=9C*New City York=E2=80=9D *or different = permutations it recognize the entity as Location, BUT If we call the service with =E2=80=9C*New City=E2=80=9D* it does not r= ecognize it as a Location. Can anyone tell me how I can implement or configure this behavior for the Concept Annotator? --089e0112c88653f619050f6f8d16--