opennlp-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jörn Kottmann <kottm...@gmail.com>
Subject Re: OpenNLP Annotations Proposal
Date Fri, 24 Jun 2011 14:54:34 GMT
We should have a look at phrase detectives as a sample:
http://anawiki.essex.ac.uk/phrasedetectives/

They have a short tutorial/test a user needs to do, they have a leader 
board,
progress made by the user is tracked.

Jörn

On 6/24/11 1:47 PM, Jörn Kottmann wrote:
> On 6/24/11 1:38 PM, Hannes Korte wrote:
>> On 06/24/2011 11:54 AM, Olivier Grisel wrote:
>>> 2011/6/24 Hannes Korte<hannes.korte@iais.fraunhofer.de>:
>>>> On 24.06.2011 10:03, Jörn Kottmann wrote:
>>>>> Hannes and Olivier, do you want to take over the part about the 
>>>>> web based
>>>>> annotation tooling? I called it for now Corpus Refiner, but we can of
>>>>> course change
>>>>> the name to something else.
>>>> Yes, I'll try to find some time in the next days to have a look at 
>>>> what
>>>> Olivier already committed and to work on the javascript part of the 
>>>> webGUI.
>>> My stuff is not following the new design: in particular it treats
>>> sentences as individual sentences. Maybe you should go on from the
>>> existing Walter design that treats CAS as individual, multi-sentences
>>> documents instead and try to align it with the tooling available in
>>> opennlp-uima: but we need to agree on a CAS type system first. I don't
>>> know the opennlp-uima myself and won't have time to invest more effort
>>> on this project before mid-july unfortunately.
>>>
>> Ok, till then I'll work with some dummy documents. By the way, who wants
>> to build the active learning component, which decides which sentences
>> come next?
> Would it be possible for you to contribute the Walter code to OpenNLP?
>
> In a previous project I used some kind of filtering to find CASes 
> which should be
> annotated. Maybe that is an approach which could work well for us here 
> too.
> The corpus server will index all CASes with annotations, and then the 
> corpus refiner or walter
> server can query the index to find CASes it should hand out to 
> annotators. Sure this logic
> would be task dependent.
>
> This could also be done in a more controlled way, where we insert 
> annotations into the CAS which
> say that this area should be labeled manually.
>
> Lets update the OpenNLP Annotations proposal a little to describe 
> these things.
>
> Jörn
>


Mime
View raw message