Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 58086 invoked from network); 22 Aug 2005 03:52:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 22 Aug 2005 03:52:27 -0000 Received: (qmail 97316 invoked by uid 500); 22 Aug 2005 03:52:23 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 97292 invoked by uid 500); 22 Aug 2005 03:52:23 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 97279 invoked by uid 99); 22 Aug 2005 03:52:23 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 21 Aug 2005 20:52:23 -0700 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=DNS_FROM_RFC_ABUSE X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [68.142.200.34] (HELO web31101.mail.mud.yahoo.com) (68.142.200.34) by apache.org (qpsmtpd/0.29) with SMTP; Sun, 21 Aug 2005 20:52:40 -0700 Received: (qmail 8201 invoked by uid 60001); 22 Aug 2005 03:52:20 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:Received:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=Yi+KA+rwt6/FN/xDSkeN/wm1eBQe3GRHNtHg+bpBV3yG3hf9raNPr/CUEuOgHqc60yNsp0FymLEUJu2twi2ZSf54AFhV5Vgd1DQckm/2a0ngopS5DdXalXo1TtLlplHr8JYspLsoGS2GUkqwkMqHmmMOQgQejMw7qOC2JavFwEM= ; Message-ID: <20050822035220.8199.qmail@web31101.mail.mud.yahoo.com> Received: from [69.201.142.21] by web31101.mail.mud.yahoo.com via HTTP; Sun, 21 Aug 2005 20:52:19 PDT Date: Sun, 21 Aug 2005 20:52:19 -0700 (PDT) From: Otis Gospodnetic Subject: Re: NGram Language Categorization Source To: java-user@lucene.apache.org In-Reply-To: <30c6373b050821134560a3bb11@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hello, Sounds like that LI acronym was confusing -Language Identification. Otis > > It was > > also found that the way you create ngram profiles (e.g. with or > without > > surrounding spaces, single length or mixed length) affects the LI > > performance. > > LI??? > > I haven't benchmarked it but I'd be interested in any suggestions you > have. > > > So, there is still a lot to do in this area, if you come up with > some > > unique way of improving LI performance... > > Maybe I'm being dense but what is LI performance? --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org