Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 1893 invoked from network); 3 Jul 2007 12:57:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 3 Jul 2007 12:57:11 -0000 Received: (qmail 56058 invoked by uid 500); 3 Jul 2007 12:57:14 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 55951 invoked by uid 500); 3 Jul 2007 12:57:13 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 55942 invoked by uid 99); 3 Jul 2007 12:57:13 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jul 2007 05:57:13 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=UNPARSEABLE_RELAY X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [192.5.164.99] (HELO casc.gd-ais.com) (192.5.164.99) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 03 Jul 2007 05:57:06 -0700 Received: from ([10.13.13.25]) by camv02-relay2.casc.gd-ais.com with SMTP id 5202701.37195938; Tue, 03 Jul 2007 05:55:57 -0700 Received: from nybf01-mail01.ad.gd-ais.com ([137.100.60.11]) by vaff01-fes01.ad.gd-ais.com with Microsoft SMTPSVC(6.0.3790.1830); Tue, 3 Jul 2007 08:55:57 -0400 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: quoted-printable X-MimeOLE: Produced By Microsoft Exchange V6.5 Subject: RE: UIMA Beginners Help? Date: Tue, 3 Jul 2007 08:55:53 -0400 Message-ID: <31923CD68FF05B42B1D3AC08F2D733B5028012D9@nybf01-mail01.ad.gd-ais.com> In-Reply-To: <4689E013.2050306@coling-uni-jena.de> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: UIMA Beginners Help? thread-index: Ace9NAy2OD++kZLnQv6u1zgljDJoKAANvNnQ References: <46895649.4040407@serff.net> <4689E013.2050306@coling-uni-jena.de> From: "LeHouillier, Frank D." To: X-OriginalArrivalTime: 03 Jul 2007 12:55:57.0306 (UTC) FILETIME=[7F995DA0:01C7BD71] X-Virus-Checked: Checked by ClamAV on apache.org So this brings up a legitimate question I have. Even if you can specify a pipeline in a CPE, wouldn't it be better practice to use an Aggregate Analysis Engine in cases where there are actual input dependencies between two or more analysis engines. I had always understood the purpose of the CPE to be getting collections of data to and from a set of independent Analysis Engines, rather than specifically for defining dependencies. Is it part of the UIMA spec for example that the CPE define an order on the Analysis Engines? The CPE specifically excludes the inclusion of a Flow Controller description. Another argument I have for using Aggregate AE's in a situation where there are dependencies is that someone might want to use it without using the CPE at all. For example, somebody might already have an application that handles most of the CPE stuff and just want to plug in the UIMA-JNET, they would have to create an Aggregate of the Sentence Annotator and NE extractor anyway, right? On the other hand, is there a way to create an Aggregate Analysis Engine, specifying the primitive Analysis Engines as separate services? If I want the Sentence Annotator to be a service and the NE extractor to be a service, how do I make sure that the CAS hits these in the right order? How do I do this if the flow is not simply linear but dynamic (i.e. the output of the language identifier sends the CAS to the correct Sentence annotator)?=20 -----Original Message----- From: Katrin Tomanek [mailto:tomanek@coling-uni-jena.de]=20 Sent: Tuesday, July 03, 2007 1:35 AM To: uima-user@incubator.apache.org Subject: Re: UIMA Beginners Help? Dear Andrew, I have added a small documentation to our website, explaining how to set up a Collection Processing Engine (CPE) from PEAR packages. See here: https://watchtower.coling.uni-jena.de/~tomanek/UIMA/ There is also a small demo-CPE. In case you want to use the components rather programatically, i.e. in=20 an application, you might refer the "UIMA Tutorial and Developers'=20 Guides" (UIMA Version 2.1), section 3.2 (Using Analysis Engines). Do it=20 as explained there, i.e. make a AE from the sentence splitter and from=20 the named entity tagger. Create a CAS (important: as explained in=20 3.2.6!) and then just run the process method of both AEs, sentence=20 splitter first, then the ne tagger, on the CAS you created. Hope that works. Best wishes, Katrin --=20 Katrin Tomanek Jena University Language and Information Engineering (JULIE) Lab Phone: +49-3641-944307 Fax: +49-3641-944321 email: tomanek@coling-uni-jena.de URL: http://www.julielab.de