Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 78602 invoked from network); 26 Oct 2006 05:29:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Oct 2006 05:29:18 -0000 Received: (qmail 23343 invoked by uid 500); 25 Oct 2006 20:17:25 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 23306 invoked by uid 500); 25 Oct 2006 20:17:24 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 23295 invoked by uid 99); 25 Oct 2006 20:17:24 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Oct 2006 13:17:24 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [64.90.160.18] (HELO server1.threattracker.com) (64.90.160.18) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Oct 2006 13:17:12 -0700 Received: from [192.168.1.99] (gate.marathonconsulting.com [69.38.225.18]) (authenticated) by server1.threattracker.com (8.11.6/8.11.6) with ESMTP id k9PKH0c23998 for ; Wed, 25 Oct 2006 16:17:01 -0400 Message-ID: <453FC61F.5010505@alias-i.com> Date: Wed, 25 Oct 2006 16:16:31 -0400 From: Breck Baldwin User-Agent: Mozilla Thunderbird 1.0.6 (X11/20050716) X-Accept-Language: en-us, en MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: experiences with lingpipe References: <453C7D4F.2090502@uni-hd.de> <453D5740.4050005@alias-i.com> <453F86F0.5060101@uni-hd.de> In-Reply-To: <453F86F0.5060101@uni-hd.de> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Martin Braun wrote: > Hi Breck, > > thanks for your answer. > > >>>What about performance? >> >>Tuning params dominate the performance space. A small beam (16 active >>hypotheses) will be quite snappy (I have 200 queries/sec with a 32 beam. >>over a 80 gig text collection that with some pruning was 5 gig in memory >>running an 8 gram model) >> > > > That's really impressive (though I didn't understand what you mean with > "beams"). Beam is how many active spellings your search space has. So 'brek' with a beam of 3 would retain 3 different spelling variations as a result of the best scoreing edits against the underlyinig language model--it is maintained in a left to right scan of the word. > > Did I unterstand the license term correctly, that I could use Lingpipe > for free when I am building a Search Engine for a Academic Website (for > free use)? Yep. best breck > > thanks, > martin > > >>Tuning is a big deal and I need to write a tuning tutorial. I am doing >>more teaching/training now so that may happen. >> >> >>breck >> >> >>> >>>Does anybody have a good idea how to find typos in the index. >>> >>>tia, >>>martin >>> >>> >>>--------------------------------------------------------------------- >>>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >>>For additional commands, e-mail: java-user-help@lucene.apache.org >> >>--------------------------------------------------------------------- >>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >>For additional commands, e-mail: java-user-help@lucene.apache.org >> > > > -- Breck Baldwin Alias-i, Inc. 181 North 11th Street, Suite 401 Brooklyn, NY 11211 v:718.290.9170 f:718.290.9171 m:917.292.8845 breck@alias-i.com --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org