Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 92348 invoked from network); 4 Nov 2008 16:38:37 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Nov 2008 16:38:37 -0000 Received: (qmail 82154 invoked by uid 500); 4 Nov 2008 16:38:43 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 82136 invoked by uid 500); 4 Nov 2008 16:38:43 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 82120 invoked by uid 99); 4 Nov 2008 16:38:43 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Nov 2008 08:38:43 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of msa@schor.com designates 67.18.22.69 as permitted sender) Received: from [67.18.22.69] (HELO gateway10.websitewelcome.com) (67.18.22.69) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 04 Nov 2008 16:37:25 +0000 Received: (qmail 3502 invoked from network); 4 Nov 2008 16:50:20 -0000 Received: from gator74.hostgator.com (67.18.27.130) by gateway10.websitewelcome.com with SMTP; 4 Nov 2008 16:50:20 -0000 Received: from yktgi01e0-s5.watson.ibm.com ([129.34.20.19]:1820 helo=[9.2.34.80]) by gator74.hostgator.com with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.68) (envelope-from ) id 1KxOuh-0000vq-Pr for uima-user@incubator.apache.org; Tue, 04 Nov 2008 10:37:56 -0600 Message-ID: <49107A60.6030700@schor.com> Date: Tue, 04 Nov 2008 11:37:52 -0500 From: Marshall Schor User-Agent: Thunderbird 2.0.0.17 (Windows/20080914) MIME-Version: 1.0 To: uima-user@incubator.apache.org Subject: Re: Question on Capabilities of AE descriptor References: <490EC6BF.3040704@barcelonamedia.org> In-Reply-To: <490EC6BF.3040704@barcelonamedia.org> X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - gator74.hostgator.com X-AntiAbuse: Original Domain - incubator.apache.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - schor.com X-Virus-Checked: Checked by ClamAV on apache.org Bart Mellebeek wrote: > > Hello, > > > > I have a question on the exact role of the output types in the > > Capabilities of an AE descriptor that I couldn't find in the > > documentation. > > A strange thing happens when I try to manipulate the descriptors of > > ex4/ of the tutorial in uimaj-examples. I am running > > ex4/MeetingDetectorTAE.xml with UIMA Document Analyzer. When I delete > > the output type RoomNumber in the Capabilities of > > ex2/RoomNumberAnnotator.xml and I run ex4/MeetingDetectorTAE.xml, the > > RoomNumber type is still visible in the analysis results. > I think this is because ex4/MeetingDetectorTAE.xml itself declares it outputs the RoomNumber type. The DocumentAnalyzer is just a sample application that shows *selected* feature structure types - selected by looking at the output capabilities of the top-most analysis engine (in the case of an aggregate having "nested" components - such as you have in your example). This means that the DocumentAnalyzer may not be showing all the feature structures in the CAS, but that doesn't mean that those feature structures are not there. See the code in uimaj-tools project: in src/main/org/apache/uima/tools/docanalyzer/DocumentAnalyzer.java, lines 1185 - 1207. > > Likewise, when I delete the output types TimeAnnot and DateAnnot in > > the capabilities of ex3/TutorialDateTime.xml, these types are still > > visible in the analysis results. > I think for the same reason - the ex4/MeetingDetectorTAE.xml itself declares it outputs the the DateAnnot and TimeAnnot feature structures. > > Only deleting the output type DateTimeAnnot in the capabilities of > > ex3/TutorialDateTime.xml seems to have an impact on the analysis results. > I ran the DocAnalyzer without modifying the examples, and the DateTimeAnnot does *not* appear - this is the expected behavior because it is not listed in the DocumentAnalyzer's output capabilities. I think it will not appear, even if you don't delete the output type DateTimeAnnot in the capabilities of ex3/TutorialDateTime.xml. > > > > Why is it that deleting some output types have no impact on analysis > > results, while deleting other output types do have an impact? Aren't > > all output types supposed to have this impact? > The UIMA framework makes the UIMA Metadata available to applications, but doesn't specify what those application do with that data. The DocumentAnalyzer is just a sample application - built to show many of the capabilities of UIMA. It took a particular design choice - to show annotations in the CAS that were specified as output capabilities of the top-most component (in the case of aggregates). Hope that helps. -Marshall > > Any help appreciated. > Thanks, > > Bart > >