Return-Path: X-Original-To: apmail-ctakes-dev-archive@www.apache.org Delivered-To: apmail-ctakes-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E357A10470 for ; Sun, 16 Feb 2014 15:26:51 +0000 (UTC) Received: (qmail 72702 invoked by uid 500); 16 Feb 2014 15:26:51 -0000 Delivered-To: apmail-ctakes-dev-archive@ctakes.apache.org Received: (qmail 72581 invoked by uid 500); 16 Feb 2014 15:26:46 -0000 Mailing-List: contact dev-help@ctakes.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ctakes.apache.org Delivered-To: mailing list dev@ctakes.apache.org Received: (qmail 72573 invoked by uid 99); 16 Feb 2014 15:26:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 Feb 2014 15:26:45 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of john.travis.green@gmail.com designates 209.85.212.181 as permitted sender) Received: from [209.85.212.181] (HELO mail-wi0-f181.google.com) (209.85.212.181) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 Feb 2014 15:26:38 +0000 Received: by mail-wi0-f181.google.com with SMTP id hi5so1686714wib.14 for ; Sun, 16 Feb 2014 07:26:18 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=d2AFZ8/yqBLe0Nx1VSTmCv4gvPIrqcOhS9buRXCP+u0=; b=FJFew0tHhY5HuaexRo/ihM0m4zyN2CRrbIQ9XauhlNFgITxmaHVIknOIBbDF7jsqlb U2s+uwwt0dOqs7uLyqfM80PxQW5Ka1hh7x1EFn9r9/z3pISEVB2py63i8G3vYL6+lEgk VmDfs1a4Yv/tdsqnLtveuDCq4IQa+bZFDuTf4Mel2zXb3glzYEJpJ/5ye5S1EILxQEYl KT4lPrVaMoEkl0liIzjHwJk0XdDAJJovaJJsBVACFYV4twiecHUNR6KoRjKgCD0dEpTZ YNG5R2g5GcJ1XjQKOaYOOrepwCK+3L+ugD4i28534PvDpzWsFp4wSseAq4cLMonB6Gpf i5iQ== MIME-Version: 1.0 X-Received: by 10.180.89.225 with SMTP id br1mr9516789wib.38.1392564378239; Sun, 16 Feb 2014 07:26:18 -0800 (PST) Received: by 10.216.108.65 with HTTP; Sun, 16 Feb 2014 07:26:18 -0800 (PST) Date: Sun, 16 Feb 2014 10:26:18 -0500 Message-ID: Subject: Re: Sectionizer From: John Green To: "dev@ctakes.apache.org" Content-Type: multipart/alternative; boundary=e89a8f3ba36f1788a204f287a979 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f3ba36f1788a204f287a979 Content-Type: text/plain; charset=ISO-8859-1 To answer my own question, and for anyone searching the mail archives in the future, I haven't fully explored it yet, but it seems that Vanderbilt already did this with SecTag. JG On Sun, Feb 9, 2014 at 9:29 PM, digital paula wrote: > John, > > I've been out of the loop for a few weeks now w/the cTAKES developer list > and will be until end of Feb... too many pressing deadlines. I have to > step through the code again to verify if sectionizer uses reg ex but I'm > pretty sure that it does. Hmmm, that does sound interesting to train the > sectionizer from clinical notes. Not familiar with Mastif so hopefully > someone else can chime in on your question there. Talk to you and > everyone very soon. :-) > > Regards, > Paula > > > Date: Sun, 9 Feb 2014 17:05:40 -0500 > > Subject: Sectionizer > > From: john.travis.green@gmail.com > > To: dev@ctakes.apache.org > > > > I know there has been some chatter about sectionizers. From what I > > understand Paula is doing, and from what I understand YTEX does, they are > > all regular expression based, correct? > > > > Has anyone added to rule based matching a statistical/Markov type > > sectionizer? E.g. one trained from a bunch of notes? > > > > Does Mastif work this way? > > > > Thanks all, > > JG > > --e89a8f3ba36f1788a204f287a979--