From lucene-user-return-10237-apmail-jakarta-lucene-user-archive=jakarta.apache.org@jakarta.apache.org Tue Sep 14 17:57:25 2004 Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 76920 invoked from network); 14 Sep 2004 17:57:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 14 Sep 2004 17:57:25 -0000 Received: (qmail 52658 invoked by uid 500); 14 Sep 2004 17:57:16 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 52619 invoked by uid 500); 14 Sep 2004 17:57:16 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 52457 invoked by uid 99); 14 Sep 2004 17:57:15 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=DNS_FROM_RFC_ABUSE X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from [216.155.196.72] (HELO web60809.mail.yahoo.com) (216.155.196.72) by apache.org (qpsmtpd/0.28) with SMTP; Tue, 14 Sep 2004 10:57:14 -0700 Message-ID: <20040914175712.88630.qmail@web60809.mail.yahoo.com> Received: from [61.16.175.2] by web60809.mail.yahoo.com via HTTP; Tue, 14 Sep 2004 18:57:12 BST Date: Tue, 14 Sep 2004 18:57:12 +0100 (BST) From: =?iso-8859-1?q?Honey=20George?= Subject: PorterStemfilter To: lucene-user@jakarta.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi, This might be more of a questing related to the PorterStemmer algorithm rather than with lucene, but if anyone has the knowledge please share. I am using the PorterStemFilter that some with lucene and it turns out that searching for the word 'printer' does not return a document containing the text 'print'. To narrow down the problem, I have tested the PorterStemFilter in a standalone programs and it turns out that the stem of printer is 'printer' and not 'print'. That is 'printer' is not equal to 'print' + 'er', the whole of the word is stem. Can somebody explain the behavior. Thanks & Regards, George ___________________________________________________________ALL-NEW Yahoo! Messenger - all new features - even more fun! http://uk.messenger.yahoo.com --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org