uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Nioche" <lists.digitalpeb...@gmail.com>
Subject Re: running aggregate engine within CPE and client code
Date Thu, 14 Aug 2008 18:49:48 GMT
Hi guys,

As it is often the case in similar situations, I suspect that the source of
the problem is in my code. The piece of code that I gave earlier and which
runs the engine is actually embedded in a third part library. I tried using
it outside that library in a very simple class and the results I get are
very close to those of the CPE - which is reassuring. What confuses me is
that the profiler gives a very similar information for both scenarios so I
still don't know why it is slower when I use it through that code. There is
no visible reason why it is so.

I know that the library uses the Java Plugin Framework and detects the jars
to use by itself (i.e they are not explicitely set on the classpath). Maybe
something to do with the classloaders? I have no idea but I have contacted
the author of that third part library, we'll see if there is a good
explanation. The difference seems to be more important when processing small
documents; with larger docs there is almost no difference.

Thanks for your help


2008/8/14 Julien Nioche <lists.digitalpebble@gmail.com>

> they use the same uima-core.jar and the version of the JRE
> (java-1.5.0-sun-, same JVM options, same aggregate engine.
> I see that there is an object called *performanceTuningSettings* but it
> does not seem to be used by the CPE in a special way, and even if it did
> would that explain such a difference (10 times!).
> J.
> 2008/8/14 Eddie Epstein <eaepstein@gmail.com>
>> Julien,
>> A UIMA aggregate is a single-threaded animal. Deploying an aggregate
>> under UIMA AS offers the opportunity to deploy the delegates in
>> separate threads.
>> Are the classpaths different between the two scenarios? Different JRE?
>> Eddie
>> On Thu, Aug 14, 2008 at 10:14 AM, Julien Nioche
>> <lists.digitalpebble@gmail.com> wrote:
>> > Hi Eddie,
>> >
>> > Thank you for your message. Yes, the profiling includes everything in my
>> > client code, including the I/O.
>> >
>> > I checked that casPoolSize="1" in my CPM config file. Setting
>> > casPoolSize="3" in the config file makes virtually no difference, which
>> > means that (a) loading my 2000 documents in the same thread or in a
>> separate
>> > one makes no difference or (b) this parameter is not taken into account
>> at
>> > all.
>> >
>> > With an aggregate engine : is each primitive engine executed in a
>> separate
>> > thread or is the whole aggregate done in the same thread?
>> >
>> > Thank you for you help
>> >
>> > Julien
>> >
> --
> DigitalPebble Ltd
> http://www.digitalpebble.com

DigitalPebble Ltd

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message