uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marshall Schor <...@schor.com>
Subject Re: Restricting a aggregate engine to a substring or mention
Date Tue, 17 Jun 2014 12:39:42 GMT
One other thought (probably not well-formed...):

You could use 1 CAS, but multiple views.

Each view can have its own subject-of-analysis.  This might not work for you,
though, as you might want the original subject-of-analysis in order to preserve
the "offset" values for annotations' begin and end features.

Each view can have its own set of indexes.  This would enable you to index just
the annotations that were "in scope".  But this might not work for you if the
Feature Structures themselves had references beyond the "scope" - they would be
valid and you might want them to be invalid or null or ?? 

With multiple views, it is possible to have cross-view references as values in
the CAS, in case that was of interest.

Perhaps this might be something to consider.


On 6/16/2014 12:23 PM, Richard Eckart de Castilho wrote:
> The CasMultiplier is not a scoping operator per se.
> I understood that you want to scope your AEs to specific sections of a CAS.
> Since there is no generic scoping operator in UIMA (that I would be aware of),
> the next best thing one can do (I think) is to slice the CAS into multiple CAS
> that each represent the scope you want to work on and merge them in the end.
> Cheers,
> -- Richard
> On 16.06.2014, at 11:47, Dr. Armin Wegner <arminwegner@googlemail.com> wrote:
>> Hello Richard!
>> As far as I know, CasMultipliers split the CAS in two or more new
>> CASes, that are processed independently, and must be put together to a
>> final CAS again. That's not what I want to do. I have only one CAS and
>> want to add annotations to this CAS. Can this be achieved with
>> CasMultipliers?
>> Cheers,
>> Armin
>> On 6/12/14, Richard Eckart de Castilho <rec@apache.org> wrote:
>>> Hi Armin,
>>> the only generic approach that I am aware of would be a CasMultiplier.
>>> Different component collections may offer alternative solutions
>>> in general or in specific components.
>>> I believe Ruta has the concept of limiting rules to certain context
>>> annotation types, but I do not know if that also works when external
>>> AEs are invoked.
>>> Cheers,
>>> -- Richard
>>> On 12.06.2014, at 12:00, Dr. Armin Wegner <arminwegner@googlemail.com>
>>> wrote:
>>>> Hello!
>>>> Is there an UIMA component which restricts an aggregated analysis
>>>> engine to a substring of the document text or to mentions of a given
>>>> annotation type? That is, is there a UIMA aquivalent to GATE's Segment
>>>> Processing PR?
>>>> Thanks,
>>>> Armin

View raw message