uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Klügl <pklu...@uni-wuerzburg.de>
Subject Re: Marking cosnecutive tokens with RUTA
Date Sun, 31 May 2015 13:51:52 GMT
... forgot to mention that I found one problem when you would use the 
scripts with the upcoming ruta release. The package declaration does not 
match the package structure the scripts are located in. For ruta 2.2.1, 
this is not really problematic, but it will cause errors for ruta 2.3.0.

Best,

Peter


Am 31.05.2015 um 15:28 schrieb Peter Klügl:
> Hi,
>
> I looked at the code, but I haven't found anything that could cause 
> the problem. The type priotities should be fine by using the 
> BasicEngine.xml of the maven dependency.
>
> Normally, I would assume that it's caused by the visibility, e.g., an 
> annotation starts with something invisible and thus is not machted. Or 
> the filtering settings are changed and the rule expects a token but 
> finds a SPACE. I tested Main.ruta, no problems observed.
>
> Can you provide a minimal example where I can reproduce the problem?
>
> Best,
>
> Peter
>
>
> Am 28.05.2015 um 18:53 schrieb Peter Klügl:
>> Hi,
>>
>> sorry, I haven't found the time to take a closer look yet , but I will
>> at the weekend.
>>
>> Best,
>>
>> Peter
>>
>> Am 27.05.2015 um 19:22 schrieb Diego Buoro:
>>> Hi Peter!
>>> We call the script with the following lines:
>>>
>>>   URL url = Resources.getResource("Main.ruta");
>>> String text = Resources.toString(url, Charsets.UTF_8);
>>>   AnalysisEngineDescription aeDes =
>>> Ruta.createAnalysisEngineDescription(text, tsd);
>>> this.ae = UIMAFramework.produceAnalysisEngine(aeDes);
>>>
>>> CAS cas = ae.newCAS();
>>> converter.populateCas(sentence.getTextSentence(), cas);
>>>   ae.process(cas);
>>>
>>> The populateCAS method is responsible for translating our 
>>> annotations into
>>> RUTA annotations, but it doesn't set any type priority explicitly.
>>> We don't know much about type priorities, the RUTA references we 
>>> found say
>>> very little about that.Are they necessary for doing what we need?
>>>
>>> The file that contains the above lines is available here:
>>> https://github.com/Fichberg/cogroo4/blob/labXP215_Will/cogroo-gc/src/main/java/org/cogroo/tools/checker/checkers/UIMAChecker.java

>>>
>>> The processCAS mehtod is available here:
>>> https://github.com/Fichberg/cogroo4/blob/labXP215_Will/cogroo-gc/src/main/java/org/cogroo/tools/checker/checkers/uima/UimaCasAdapter.java

>>>
>>> The script we are calling is available here:
>>> https://github.com/Fichberg/cogroo4/blob/labXP215_Will/cogroo-ruta/script/Main.ruta

>>>
>>>
>>> PS:Yes, We remembered the semicolons.
>>>
>>> Thanks for the help :)
>>>
>>>
>>>
>>> 2015-05-26 15:30 GMT-03:00 Diego Buoro <jklports@gmail.com>:
>>>
>>>> I think i wasn't clear enough, and i should be more specific.
>>>>
>>>> I have a type system in which all words have been annotated as 
>>>> Tokens. I
>>>> am calling a RUTA script from a java class, and that script has 
>>>> only one
>>>> rule:
>>>> Token Token {-> Problem}
>>>>
>>>> However, with this script, no Problems are created. When I try
>>>> Token {-> Problem}
>>>>
>>>> I get one problem for each Token, which is what I expected. Why 
>>>> can't I
>>>> create annotations using rules with more than one word?
>>>>
>>>> Thanks
>>>>
>>>>
>>>>
>>>>
>>>> 2015-05-26 14:49 GMT-03:00 Diego Buoro <jklports@gmail.com>:
>>>>
>>>>> Hello guys,how are you doing?
>>>>>
>>>>> I would like to know once i have called RUTA from a Java project, 
>>>>> how can
>>>>> i mark consecutive tokens as a "Problem" (the name of my 
>>>>> annotation, in
>>>>> this case)?
>>>>>
>>>>> Thanks in advice!
>>>>>
>


Mime
View raw message