Return-Path: X-Original-To: apmail-incubator-opennlp-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-opennlp-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3BCDA42F8 for ; Tue, 17 May 2011 18:40:36 +0000 (UTC) Received: (qmail 27909 invoked by uid 500); 17 May 2011 18:40:36 -0000 Delivered-To: apmail-incubator-opennlp-dev-archive@incubator.apache.org Received: (qmail 27886 invoked by uid 500); 17 May 2011 18:40:36 -0000 Mailing-List: contact opennlp-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: opennlp-dev@incubator.apache.org Delivered-To: mailing list opennlp-dev@incubator.apache.org Received: (qmail 27878 invoked by uid 99); 17 May 2011 18:40:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 May 2011 18:40:36 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of olegtikhonov@gmail.com designates 209.85.214.47 as permitted sender) Received: from [209.85.214.47] (HELO mail-bw0-f47.google.com) (209.85.214.47) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 May 2011 18:40:30 +0000 Received: by bwz5 with SMTP id 5so824863bwz.6 for ; Tue, 17 May 2011 11:40:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=H8GAB2bj0cJEmkVpwviGJFGATkMShvgE0YJzxuohF0Q=; b=a3agGPC2SR8qDJp3ja+BVS8vfbdNzXuHRyRAGHBCp1hMgRiscg/4S/7xEeLgVljM2h qeV4ASnvfma8XZJoB1x4d9uMBx0h1WNv0x4t5TfA0j2WsE+Pqj4iOqY9C51dNyiispJc NcXOPOvBATHOxWqGGXJ1M7c/ybD0Y/OyQFKAc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; b=Asg9INgpSpCfOQqn9ueM40XNFeQn+UyrmiqfJ9Tok61/bfZZS0P80JXKj5ceV/18PW eszCx4cKfk+G2HQsvej8SjFwMQBITyVHTo5T2/w2U41Vx4BfPN1q4iZOsitGGAW6A6CR VT2U6fmUitHbJuvH/Csnw4CjGeoUFJnSSdaeo= MIME-Version: 1.0 Received: by 10.204.151.204 with SMTP id d12mr878261bkw.127.1305657609755; Tue, 17 May 2011 11:40:09 -0700 (PDT) Sender: olegtikhonov@gmail.com Received: by 10.204.26.209 with HTTP; Tue, 17 May 2011 11:40:09 -0700 (PDT) In-Reply-To: References: Date: Tue, 17 May 2011 21:40:09 +0300 X-Google-Sender-Auth: 273dBqiG0j7YbxATQPhHIqTWc2k Message-ID: Subject: Re: switch to ISO 639-2 codes for languages? From: Oleg Tikhonov To: opennlp-dev@incubator.apache.org, jbaldrid@mail.utexas.edu Content-Type: multipart/alternative; boundary=0015175cd058071bc604a37d1a41 X-Virus-Checked: Checked by ClamAV on apache.org --0015175cd058071bc604a37d1a41 Content-Type: text/plain; charset=ISO-8859-1 My two cents, tesseract-ocr also uses ISO 639-3 and it would be great for those who builds the solutions such as openNLP + tesseract. -Oleg On Tue, May 17, 2011 at 9:33 PM, Jason Baldridge wrote: > I think we should change to the three character convention for language > specific materials, e.g. "eng" rather than "en" for English. > > http://en.wikipedia.org/wiki/List_of_ISO_639-2_codes > > Do others agree? > > -- > Jason Baldridge > Assistant Professor, Department of Linguistics > The University of Texas at Austin > http://www.jasonbaldridge.com > http://twitter.com/jasonbaldridge > --0015175cd058071bc604a37d1a41--