Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DEB9B11A97 for ; Sun, 11 May 2014 13:48:21 +0000 (UTC) Received: (qmail 82403 invoked by uid 500); 11 May 2014 12:48:21 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 82346 invoked by uid 500); 11 May 2014 12:48:21 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 82209 invoked by uid 99); 11 May 2014 12:48:21 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 11 May 2014 12:48:21 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of chakraborti.anirban@googlemail.com designates 209.85.216.51 as permitted sender) Received: from [209.85.216.51] (HELO mail-qa0-f51.google.com) (209.85.216.51) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 11 May 2014 12:48:17 +0000 Received: by mail-qa0-f51.google.com with SMTP id w8so5890128qac.24 for ; Sun, 11 May 2014 05:47:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=N1LoN3jOeNmWcuSkmqKvqQF2FGZcECMpUnrw6fKllKA=; b=Nq4ZXTstbpRHUStARzKNXSic9894Jjhc06IPC2y2qqF0ICLTG5vq+/08xJ0yfHDRxK LeesvX9HF0Hg90CxJZ53PTPe/q/g5EvDtTzlULGDoyl4HRQQ2Qnjmc2gCJ99xm6AEhwC vr4Z6ZkazyTKbRrFqgNj9yRB7F/BOr+bx7lN1/qDTW9Fjm1RPOH4BJQ41tuLycTKFKtq QXf7HRwqB7j/DJRJ2jILKWrA/uh2JHSbtZw5LvMLjs2dBer+72Fxq+5LSzU4U/1CETdM nXCa8c4nYwj/pXwEOT0mHEuXeDEVqdUj05uG71heayD5tsRgo5cZi5fJ7VeLfAytptns gQpQ== MIME-Version: 1.0 X-Received: by 10.140.47.167 with SMTP id m36mr28492261qga.21.1399812473843; Sun, 11 May 2014 05:47:53 -0700 (PDT) Received: by 10.140.91.82 with HTTP; Sun, 11 May 2014 05:47:53 -0700 (PDT) In-Reply-To: References: Date: Sun, 11 May 2014 18:17:53 +0530 Message-ID: Subject: Re: markable types From: Anirban Chakraborti To: dev@ctakes.apache.org Content-Type: multipart/alternative; boundary=001a11c1641e4173ec04f91f3d03 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c1641e4173ec04f91f3d03 Content-Type: text/plain; charset=UTF-8 Steven, Would you have any example code of tree parser so the output can be arranged as per need. I mean, after successful annotation, I want to extract certain concepts like medication only and arrange them in a new tree so that all annotation in reference to medication concept and their sources are listed together. Anir On Sun, May 11, 2014 at 3:55 PM, Steven Bethard wrote: > I don't think "not something anyone would want extracted" should be an > argument against anything. We already have constituent and dependency > parse trees in the type system, and those would fall under that > category. > > So +1 on markables in the type system. (In general, +1 on moving > module-specific types to the standard type system. I'm not sure what > the real benefit of splitting them out is...) > > Steve > > On Fri, May 9, 2014 at 11:53 AM, Miller, Timothy > wrote: > > What do people think about taking the "markable" types out of the > > coreference project and adding them to the standard type system? This is > > a pretty standard concept in coreference that doesn't really have a > > great natural representation in the current type system -- it > > encompasses IdentifiedAnnotations as well as pronouns ("It", "him", > > "her") and some determiners ("this"). > > > > The drawback I can see is that it is probably not something anyone would > > want extracted -- ultimately you want the actual coref pairs or chains. > > But it is useful for things like representing gold standard input or > > splitting coreference resolution into separate markable recognition and > > relation classification steps. > > > > Tim > > > --001a11c1641e4173ec04f91f3d03--