From java-user-return-44796-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Mon Feb 01 09:28:45 2010 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 45786 invoked from network); 1 Feb 2010 09:28:45 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 1 Feb 2010 09:28:45 -0000 Received: (qmail 11044 invoked by uid 500); 1 Feb 2010 09:28:43 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 10967 invoked by uid 500); 1 Feb 2010 09:28:42 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 10957 invoked by uid 99); 1 Feb 2010 09:28:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Feb 2010 09:28:42 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ian.lea@gmail.com designates 209.85.218.209 as permitted sender) Received: from [209.85.218.209] (HELO mail-bw0-f209.google.com) (209.85.218.209) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Feb 2010 09:28:34 +0000 Received: by bwz1 with SMTP id 1so2167192bwz.12 for ; Mon, 01 Feb 2010 01:28:13 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=YTsOZAJU29BczJnAj8onDccJ3evOnxD+OFxyOcqzFy4=; b=xb9tuC2XMaVyRCsZ4WnjiSCpxlNPXLSlMKrDJMW1PXdicFOa8xnkbRXfDvEl2Vo7I4 6rmDlHEQfR6+iXa54QVHHOxzkFuuf5zw3WrMl3BJ297+CB8jNWjFom8MXrlIvZqYVJjN 4uJGNCAAmfyusBLBUkOFOJ3FxxWN+29PAv2xE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=pq66nVlPYFpqA2M+uL0rttRIGVeesrygb/yF7zBt12Swi6lH2+oWUXgMDs0UjQuxEg lQNRJ8HwPllLNdWCgl9rMXfW77t5GMd6XosK3+MPz+x20jRMiaE3RETtuPZUIZguKl63 Ie3Ycx+jT7TwwnzelumqXxWjr1Jog45TkQDdo= MIME-Version: 1.0 Received: by 10.204.20.142 with SMTP id f14mr3013541bkb.64.1265016492849; Mon, 01 Feb 2010 01:28:12 -0800 (PST) In-Reply-To: <27400546.post@talk.nabble.com> References: <27400546.post@talk.nabble.com> From: Ian Lea Date: Mon, 1 Feb 2010 09:27:52 +0000 Message-ID: <8c4e68611002010127u2645cb56v142a28af8671e9ce@mail.gmail.com> Subject: Re: Can't get tokenization/stop works working To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable If you make com a stop word then you won't be able to search for it, but a search for fubar should have worked. Are you sure your analyzer is doing what you want? You don't tell us what analyzer you are using. Tips: use Luke to see what has been indexed read the FAQ entry http://wiki.apache.org/lucene-java/LuceneFAQ#Why_am_I_getting_no_hits_.2BAC= 8_incorrect_hits.3F -- Ian. On Mon, Feb 1, 2010 at 7:25 AM, jchang wrote: > > I want to be able to store a doc with a field with this as a substring: > =A0www.fubar.com > And then I want this document to get returned when I query on > =A0fubar or > =A0fubar.com > > I assume what I should do is make www and com stop words, and make sure t= he > field is tokenized, so it wil break it up along the '.' > > I thought =A0I should take a list of Enlisgh stop words, add in 'www' and= com, > and then make sure the field is tokenized, which I did by using this > constructor: > new Field("name", "value", =A0Field.Store.YES, Field.Index.Analyzed). > I saw that Field.Index.Analyzed meant it would be tokenized. > > It is not working. =A0Searching on fubar or fubar.com does not return it. > Thanks for any help. > -- > View this message in context: http://old.nabble.com/Can%27t-get-tokenizat= ion-stop-works-working-tp27400546p27400546.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org