commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bruno P. Kinoshita (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TEXT-71) Add a LanguageCode converter
Date Sat, 04 Mar 2017 07:40:45 GMT

     [ https://issues.apache.org/jira/browse/TEXT-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Bruno P. Kinoshita updated TEXT-71:
-----------------------------------
    Description: 
For investigation. I am not aware of a converter for different language code standards such
as ISO-639-1, ISO-639-2, and ISO-639-3.

If you use OpenNLP, for instance, it expects the language codes in ISO-639-3. But if you have
language codes such as "en", then you will have to add a converter to "[eng|http://www-01.sil.org/iso639-3/documentation.asp?id=eng]".

This might be moved to [lang] if that makes more sense.

Useful resources for this issue:

* http://data.okfn.org/data/core/language-codes
* http://site.icu-project.org/

  was:
For investigation. I am not aware of a converter for different language code standards such
as ISO-639-1, ISO-639-2, and ISO-639-3.

If you use OpenNLP, for instance, it expects the language codes in ISO-639-3. But if you have
language codes such as "en", then you will have to add a converter to "[eng|http://www-01.sil.org/iso639-3/documentation.asp?id=eng]".

This might be moved to [lang] if that makes more sense.

Useful resources for this issue:

* http://data.okfn.org/data/core/language-codes


> Add a LanguageCode converter
> ----------------------------
>
>                 Key: TEXT-71
>                 URL: https://issues.apache.org/jira/browse/TEXT-71
>             Project: Commons Text
>          Issue Type: Bug
>            Reporter: Bruno P. Kinoshita
>            Priority: Minor
>              Labels: feedback, review
>
> For investigation. I am not aware of a converter for different language code standards
such as ISO-639-1, ISO-639-2, and ISO-639-3.
> If you use OpenNLP, for instance, it expects the language codes in ISO-639-3. But if
you have language codes such as "en", then you will have to add a converter to "[eng|http://www-01.sil.org/iso639-3/documentation.asp?id=eng]".
> This might be moved to [lang] if that makes more sense.
> Useful resources for this issue:
> * http://data.okfn.org/data/core/language-codes
> * http://site.icu-project.org/



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message