Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 40835 invoked from network); 26 Nov 2004 17:02:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 26 Nov 2004 17:02:25 -0000 Received: (qmail 40480 invoked by uid 500); 26 Nov 2004 17:02:18 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 40453 invoked by uid 500); 26 Nov 2004 17:02:17 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 40440 invoked by uid 99); 26 Nov 2004 17:02:17 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=DNS_FROM_RFC_ABUSE X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from web12703.mail.yahoo.com (HELO web12703.mail.yahoo.com) (216.136.173.240) by apache.org (qpsmtpd/0.28) with SMTP; Fri, 26 Nov 2004 09:02:15 -0800 Received: (qmail 67124 invoked by uid 60001); 26 Nov 2004 17:02:13 -0000 Comment: DomainKeys? See http://antispam.yahoo.com/domainkeys DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; b=riUnEVwRqgVlhJBqZlCjqpzdTYEhKzyKpucFiR93QR8PSJkGHfLb1ka5O046k+TEPOc2A5VsLq4s4e2YPENtciDa5NFjDdLDVNjPGeM8jajwyzMLRgZh5JLvMo9M9/XGAsRqz0Pbl4L7I2SjeYNWuZmTfe90D5dN0z+ZkfQLtdw= ; Message-ID: <20041126170213.67122.qmail@web12703.mail.yahoo.com> Received: from [69.201.130.149] by web12703.mail.yahoo.com via HTTP; Fri, 26 Nov 2004 09:02:13 PST Date: Fri, 26 Nov 2004 09:02:13 -0800 (PST) From: Otis Gospodnetic Subject: Re: Dutch Analyzer dictionary format? To: Lucene Users List In-Reply-To: <6.1.2.0.2.20041126100835.0666ca30@pop.xs4all.nl> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Judging from everything you've said, the answer is yes. I don't use Dutch Analyzer, so I'm not 100% sure about this, but it sounds easy enough to try. Otis --- Twan Kogels wrote: > Hello all, > > I'm using lucene to search through a couple of documents to find > interesting documents. Most documents are in Dutch language. I saw > that the > default snowball stemmer wasn't doing well on text written in a > foreign > language. Lucky i found a Dutch text analyzer in de lucene sandbox > project. > > I've read the javadoc and found out it needs a stemdictionary. You > can load > this dictionary with the following function: > DutchAnalyzer.setStemDictionary(File f) > > The format needs to be a tab separator list (word [tab] stem). > > To be sure i do everything correctly i've got a question about the > dictonary: > Can i just get: > > and convert it to a tab separated list and then "feed" it to the > setStemDictionary() function? > > Kind regards, > Twan Kogels > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org