opennlp-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jörn Kottmann (JIRA) <>
Subject [jira] Commented: (OPENNLP-15) Add support for the CoNLL 03 data format
Date Thu, 16 Dec 2010 12:09:00 GMT


Jörn Kottmann commented on OPENNLP-15:

The handling of the file encoding must be improved. Currently the UTF-8 encoding is hardcoded
for both generated input files, german and english.
The input files are generated by conll03 scripts. The input encoding might be defined by the

conll03 scripts (must be checked). In the case that is true the encoding could be hardcoded,
otherwise it should be configurable.

If the encoding is hardcoded, the cmd line interface must be improved to not accept an encoding
which is not used.

> Add support for the CoNLL 03 data format
> ----------------------------------------
>                 Key: OPENNLP-15
>                 URL:
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Name Finder
>    Affects Versions: tools-1.5.0-sourceforge
>            Reporter: Jörn Kottmann
>             Fix For: tools-1.5.1-incubating
> Adding support to convert CoNLL 03 Reurters Support to NameFinder.
> Work on this issue began over at sourceforge:

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message