Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 17091 invoked from network); 13 Apr 2011 08:51:45 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 13 Apr 2011 08:51:45 -0000 Received: (qmail 42784 invoked by uid 500); 13 Apr 2011 08:51:43 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 42731 invoked by uid 500); 13 Apr 2011 08:51:42 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 42702 invoked by uid 99); 13 Apr 2011 08:51:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Apr 2011 08:51:39 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=FREEMAIL_FROM,FREEMAIL_REPLYTO,RCVD_IN_DNSWL_NONE,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of simon.willnauer@googlemail.com designates 209.85.220.176 as permitted sender) Received: from [209.85.220.176] (HELO mail-vx0-f176.google.com) (209.85.220.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Apr 2011 08:51:33 +0000 Received: by vxa37 with SMTP id 37so428479vxa.35 for ; Wed, 13 Apr 2011 01:51:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:mime-version:reply-to:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=yEjspCF2Hw5w+ZY4zwHwif/8/4u3MniLB1dx6/JVFdA=; b=wr6eiuqB29pkQAppIPVld47DyGNKmqdGdWx2uaTIchBgrpZYtem/OiTyrHeWRTJ7Oc k2wZRBCJkth85FRQWr9v4XaM04SMfGpuRifuUpRlljUFjKBbYa7i4dBq3ZR4z4DMkXZC oLHEx9qhqccpzLu0+HoBysEse1aCBAUanE4xg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=mime-version:reply-to:in-reply-to:references:date:message-id :subject:from:to:cc:content-type:content-transfer-encoding; b=YpgwLN3ONOdgucHsKYUm5O4FKzd6niNfGH4aUlEX5uiX9HCpsUa3MWpATbUk92ntmK tiOfBoovPsZFe+kPTJhdxzrUbtSHaWux+vFSAq1t+Y8eH7iKnhI55iWtQoPbIOJgNZ7P qvw0HUm76cgBoEkDrnSktOmfazkTbXfeJ2xoE= MIME-Version: 1.0 Received: by 10.52.18.15 with SMTP id s15mr9471710vdd.224.1302684672179; Wed, 13 Apr 2011 01:51:12 -0700 (PDT) Received: by 10.52.165.35 with HTTP; Wed, 13 Apr 2011 01:51:12 -0700 (PDT) Reply-To: simon.willnauer@gmail.com In-Reply-To: References: Date: Wed, 13 Apr 2011 10:51:12 +0200 Message-ID: Subject: Re: German*Filter, Analyzer "cutting" off letters from (french) words... From: Simon Willnauer To: java-user@lucene.apache.org Cc: Clemens Wyss Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org On Wed, Apr 13, 2011 at 9:51 AM, Clemens Wyss wrote: > What I really want to do is ignore german stop words such as "der", "die"= , "das", "ein",... GermanAnalyzer takes a stemExclusionSet if you put those terms into this set the stemmer will not touch them. This should be in 3.1 I think public GermanAnalyzer(Version matchVersion, Set stopwords, Set stemExclusionSet) simon > >> -----Urspr=C3=BCngliche Nachricht----- >> Von: Robert Muir [mailto:rcmuir@gmail.com] >> Gesendet: Dienstag, 12. April 2011 17:03 >> An: java-user@lucene.apache.org >> Betreff: Re: German*Filter, Analyzer "cutting" off letters from (french) >> words... >> >> On Tue, Apr 12, 2011 at 8:46 AM, Clemens Wyss >> wrote: >> > Why so? Where have the e's gone? >> > >> >> the e is being stemmed as its a german suffix... all of the german stemm= ing >> algorithms remove final -e, as do all the french stemming algorithms. >> >> so i don't understand your problem. >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org