Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 93874 invoked from network); 26 Nov 2008 13:28:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Nov 2008 13:28:36 -0000 Received: (qmail 45525 invoked by uid 500); 26 Nov 2008 13:28:46 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 45509 invoked by uid 500); 26 Nov 2008 13:28:46 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 45498 invoked by uid 99); 26 Nov 2008 13:28:46 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Nov 2008 05:28:46 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of twgoetz@gmx.de designates 213.165.64.20 as permitted sender) Received: from [213.165.64.20] (HELO mail.gmx.net) (213.165.64.20) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 26 Nov 2008 13:27:19 +0000 Received: (qmail invoked by alias); 26 Nov 2008 13:28:04 -0000 Received: from blueice2n1.de.ibm.com (EHLO [9.152.14.84]) [195.212.29.171] by mail.gmx.net (mp051) with SMTP; 26 Nov 2008 14:28:04 +0100 X-Authenticated: #25330878 X-Provags-ID: V01U2FsdGVkX19Qu8RqE19QL7gte+QZjZPSBv2VhhGRmKtj2PXHVS 0rnuwE8c0RtNWh Message-ID: <492D4EBB.1070804@gmx.de> Date: Wed, 26 Nov 2008 14:27:23 +0100 From: Thilo Goetz User-Agent: Thunderbird 2.0.0.18 (Windows/20081105) MIME-Version: 1.0 To: uima-user@incubator.apache.org Subject: Re: Additional Runtime Arguments References: <2A5EB5171962E44F84756823FD32249502C66E60@exchange.interactivesys.com> In-Reply-To: <2A5EB5171962E44F84756823FD32249502C66E60@exchange.interactivesys.com> X-Enigmail-Version: 0.95.6 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Y-GMX-Trusted: 0 X-FuHaFi: 0.68 X-Virus-Checked: Checked by ClamAV on apache.org Thomas Polzin wrote: > I probably missed something in the documentation but here is my problem: > > > I'm trying to process a pretty complex structured document and I do not > want to translate the entire structure into annotations. Instead I would > like to pass some object containing this additional structure to the > respective analysis engines. Note that a static object will not work. > Also, I do not want pass this object to the engines when they are > constructed/initialized. The best solution I came up with so far is to > subclass the CAS object and then to downcast to it in the process( CAS > cas) function. Does this make sense? Is there a better solution? > > Thanks so much in advance > > Thomas > > UIMA doesn't really support this. The reason is that the UIMA philosophy is that all data must live in the CAS. Otherwise your analysis is not portable, remotable etc. Howsoever that may be, there are various ways you can get around this restriction, if you really need to. For example, if your documents have a unique ID, you can write that into the CAS somehow. Then you could have a static factory where your annotators can go and get the object for the document ID they're working on. Somehow, you also need to clear this cache after processing has finished. You should *not* subclass the CAS, for any number of reasons. HTH, Thilo