Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 68470 invoked from network); 6 Oct 2009 12:11:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Oct 2009 12:11:19 -0000 Received: (qmail 30697 invoked by uid 500); 6 Oct 2009 12:11:17 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 30631 invoked by uid 500); 6 Oct 2009 12:11:17 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 30621 invoked by uid 99); 6 Oct 2009 12:11:17 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 12:11:17 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of erickerickson@gmail.com designates 209.85.223.186 as permitted sender) Received: from [209.85.223.186] (HELO mail-iw0-f186.google.com) (209.85.223.186) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Oct 2009 12:11:07 +0000 Received: by iwn16 with SMTP id 16so1402711iwn.29 for ; Tue, 06 Oct 2009 05:09:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=DzVioSEDupBrmR12Z9F7u0/xngIOzgIlyovVFpzNpcs=; b=HevR0b4XSM/NWZ9Ox1CumsBLcNKGEiGUysoJc8znA2s2e/1+L5jc1wVtjXe8k9g9Q6 1l+JViG1T430R1YJRGLNy+iKmssWA466kE4y17fg5OhI2jFEN+KQX+gYCjZHhwAcyou2 8L2kxQ4j5uCwoH1zyEO+y1mAYxLMFv92USl4Q= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=Zk7cPReUeLS9dcUPlrA6aBxLgIp3Jz4gbiEoke4lVcWRdaKTPPAqUzf+v1sbBfnQ7r 7UvZoM6OHxt19RakkSmP3anR9RFLiV8nVuAq+Tvb798nVc7xDdAAmBqzwCSxuk0APFvy DT5zLIFspJOhss7+FxWU2pWlcKk00AjbVGyXY= MIME-Version: 1.0 Received: by 10.231.125.100 with SMTP id x36mr2411894ibr.52.1254830986860; Tue, 06 Oct 2009 05:09:46 -0700 (PDT) In-Reply-To: References: Date: Tue, 6 Oct 2009 08:09:46 -0400 Message-ID: <359a92830910060509t2d19d1c1v7554ced6116f204c@mail.gmail.com> Subject: Re: Reverse stemmer? From: Erick Erickson To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016e646432c3998290475431bae X-Virus-Checked: Checked by ClamAV on apache.org --0016e646432c3998290475431bae Content-Type: text/plain; charset=ISO-8859-1 Why do you care? That is, what is the problem you want to solve with a reversestemmer? Note that if you STORE the field, the *original* text is available, storing and indexing are orthogonal. So if all you want is to get the original text back, you can freely index with a stemming analyzer, but just fetch the stored data back and it won't be stemmed.... Erick On Tue, Oct 6, 2009 at 3:31 AM, David Leangen wrote: > > Hello, > > I've been using Lucene in a very basic way for some time now, and I'm > starting to take advantage of some of the linguistic capabilities only now. > > I am making use of the snowball analyzer for stemming, and it works very > well. > > > Question: is there any such thing as a "reverse stemmer"? In other words, > given the stem of a word, is there any algorithm to find the original word? > Or is this just fantasy? ;-) > > Now, I understand that there is a 1:n mapping of stems:words. I can deal > with that. > > > Thanks! > =David > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --0016e646432c3998290475431bae--