uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eddie Epstein <eaepst...@gmail.com>
Subject Re: Ruta parallel execution
Date Fri, 19 Dec 2014 16:33:19 GMT
Hi Silvestre,

An aggregate deployed with UIMA-AS can be used to run delegate annotators
in parallel, with a few restrictions.
 - the aggregate must be deployed as async=true
 - the parallel delegates must each be running in remote processes
 - the delegates must not modify preexisting FS

As Jens suggests, the resultant latency improvement depends on the remoting
overhead vs processing time. Latency will also be subject to the latency of
the slowest parallel delegate.

Remoting overhead can be reduced using the binary serialization option, but
then all services must have identical typesystems.

Eddie


On Fri, Dec 19, 2014 at 9:10 AM, Silvestre Losada <
silvestre.losada@gmail.com> wrote:

> Hi Jens,
>
> First of all thanks for your detailed answer. UIMA ruta has an option in
> order to execute an analisys engine from ruta script here
> <http://goo.gl/ekbhv8> is described. So inside the script you can execute
> the analysis engine and then apply some rules to the annotations created by
> the analysis engine. What I want is to have the option to execute the
> analysis engines in parallel to save time. Would it be possible?
>
> Kind regards
>
> On 19 December 2014 at 12:35, Jens Grivolla <j+asf@grivolla.net> wrote:
> >
> > Hi Silvestre,
> >
> > there doesn't seem to be anything RUTA-specific in your question. In
> > principle, UIMA-AS allows parallel scaleout and merges the results
> (though
> > I personally have never used it this way), but there are of course a few
> > things to take into account.
> >
> > First, you will of course need to properly define the dependencies
> between
> > your different analysis engines to ensure you always have all then
> > necessary information available, meaning that you can only run things in
> > parallel that are independent of one another. And then you will have to
> see
> > if the overhead from distributing your CAS to several engines running in
> > parallel and then merging the results is not greater than just having it
> in
> > one colocated pipeline that can pass the information more efficiently. I
> > guess you'll have to benchmark your specific application, but maybe
> > somebody with more experience can give you some general directions...
> >
> > Best,
> > Jens
> >
> > On Thu, Dec 18, 2014 at 12:26 PM, Silvestre Losada <
> > silvestre.losada@gmail.com> wrote:
> > >
> > > Well let me explain.
> > >
> > > Ruta scripts are really good to work over output of analysis engines,
> > each
> > > analysis engine will make some atomic work and using ruta rules you can
> > > easily work over generated annotations combine them, remove them...
> > What I
> > > need is to execute several analysis engines in parallel to improve the
> > > response time, so now the analysis engines are executed sequentially
> and
> > I
> > > want to execute them in parallel, then take the output of all of them
> and
> > > apply some ruta rules to the output.
> > >
> > > would it be possible.
> > >
> > > On 17 December 2014 at 18:13, Peter Kl├╝gl <pkluegl@uni-wuerzburg.de>
> > > wrote:
> > > >
> > > > Hi,
> > > >
> > > > I haven't used UIMA-AS (with ruta) in a real application yet, but I
> > > > tested it once for an rc. Did you face any problems?
> > > >
> > > > Best
> > > >
> > > > Peter
> > > >
> > > > Am 17.12.2014 14:34, schrieb Silvestre Losada:
> > > > > Hi All,
> > > > >
> > > > > Is there any way to execute ruta scripts in parallel, using uima-AS
> > > > >  aproach? in case yes could you provide me an example.
> > > > >
> > > > > Kind regards.
> > > > >
> > > >
> > > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message