opennlp-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jörn Kottmann <kottm...@gmail.com>
Subject Re: universal tagset
Date Fri, 15 Apr 2011 14:05:23 GMT
On 4/14/11 5:15 AM, Jason Baldridge wrote:
> For many applications, it would be useful to have a universal tagset for any
> language you are working with. See below for details on a project that
> provides mappings from many standard treebanks to a course-grained tagset
> (12 tags). We might want to support these mappings to simple tags in our
> models (e.g. have a model that uses corpus-native tags and another that uses
> universal tags).

All we need to do is, to replace the pos-tags in the training data with the
universal one, right? I guess we could add support for this to the
converter tool or make a small tool which can replace the tags.

Jörn

Mime
View raw message