uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Kl├╝gl <peter.klu...@averbis.com>
Subject Re: Marking cosnecutive tokens with RUTA
Date Thu, 28 May 2015 16:53:16 GMT
Hi,

sorry, I haven't found the time to take a closer look yet , but I will
at the weekend.

Best,

Peter

Am 27.05.2015 um 19:22 schrieb Diego Buoro:
> Hi Peter!
> We call the script with the following lines:
>
>  URL url = Resources.getResource("Main.ruta");
> String text = Resources.toString(url, Charsets.UTF_8);
>  AnalysisEngineDescription aeDes =
> Ruta.createAnalysisEngineDescription(text, tsd);
> this.ae = UIMAFramework.produceAnalysisEngine(aeDes);
>
> CAS cas = ae.newCAS();
> converter.populateCas(sentence.getTextSentence(), cas);
>  ae.process(cas);
>
> The populateCAS method is responsible for translating our annotations into
> RUTA annotations, but it doesn't set any type priority explicitly.
> We don't know much about type priorities, the RUTA references we found say
> very little about that.Are they necessary for doing what we need?
>
> The file that contains the above lines is available here:
> https://github.com/Fichberg/cogroo4/blob/labXP215_Will/cogroo-gc/src/main/java/org/cogroo/tools/checker/checkers/UIMAChecker.java
> The processCAS mehtod is available here:
> https://github.com/Fichberg/cogroo4/blob/labXP215_Will/cogroo-gc/src/main/java/org/cogroo/tools/checker/checkers/uima/UimaCasAdapter.java
> The script we are calling is available here:
> https://github.com/Fichberg/cogroo4/blob/labXP215_Will/cogroo-ruta/script/Main.ruta
>
> PS:Yes, We remembered the semicolons.
>
> Thanks for the help :)
>
>
>
> 2015-05-26 15:30 GMT-03:00 Diego Buoro <jklports@gmail.com>:
>
>> I think i wasn't clear enough, and i should be more specific.
>>
>> I have a type system in which all words have been annotated as Tokens. I
>> am calling a RUTA script from a java class, and that script has only one
>> rule:
>> Token Token {-> Problem}
>>
>> However, with this script, no Problems are created. When I try
>> Token {-> Problem}
>>
>> I get one problem for each Token, which is what I expected. Why can't I
>> create annotations using rules with more than one word?
>>
>> Thanks
>>
>>
>>
>>
>> 2015-05-26 14:49 GMT-03:00 Diego Buoro <jklports@gmail.com>:
>>
>>> Hello guys,how are you doing?
>>>
>>> I would like to know once i have called RUTA from a Java project, how can
>>> i mark consecutive tokens as a "Problem" (the name of my annotation, in
>>> this case)?
>>>
>>> Thanks in advice!
>>>
>>


Mime
View raw message