uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Diego Buoro <jklpo...@gmail.com>
Subject Re: Marking cosnecutive tokens with RUTA
Date Wed, 27 May 2015 17:22:31 GMT
Hi Peter!
We call the script with the following lines:

 URL url = Resources.getResource("Main.ruta");
String text = Resources.toString(url, Charsets.UTF_8);
 AnalysisEngineDescription aeDes =
Ruta.createAnalysisEngineDescription(text, tsd);
this.ae = UIMAFramework.produceAnalysisEngine(aeDes);

CAS cas = ae.newCAS();
converter.populateCas(sentence.getTextSentence(), cas);

The populateCAS method is responsible for translating our annotations into
RUTA annotations, but it doesn't set any type priority explicitly.
We don't know much about type priorities, the RUTA references we found say
very little about that.Are they necessary for doing what we need?

The file that contains the above lines is available here:
The processCAS mehtod is available here:
The script we are calling is available here:

PS:Yes, We remembered the semicolons.

Thanks for the help :)

2015-05-26 15:30 GMT-03:00 Diego Buoro <jklports@gmail.com>:

> I think i wasn't clear enough, and i should be more specific.
> I have a type system in which all words have been annotated as Tokens. I
> am calling a RUTA script from a java class, and that script has only one
> rule:
> Token Token {-> Problem}
> However, with this script, no Problems are created. When I try
> Token {-> Problem}
> I get one problem for each Token, which is what I expected. Why can't I
> create annotations using rules with more than one word?
> Thanks
> 2015-05-26 14:49 GMT-03:00 Diego Buoro <jklports@gmail.com>:
>> Hello guys,how are you doing?
>> I would like to know once i have called RUTA from a Java project, how can
>> i mark consecutive tokens as a "Problem" (the name of my annotation, in
>> this case)?
>> Thanks in advice!

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message