Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 94587 invoked from network); 29 Jun 2009 18:41:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 29 Jun 2009 18:41:54 -0000 Received: (qmail 63612 invoked by uid 500); 29 Jun 2009 18:42:02 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 63552 invoked by uid 500); 29 Jun 2009 18:42:02 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 63542 invoked by uid 99); 29 Jun 2009 18:42:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Jun 2009 18:42:02 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of erickerickson@gmail.com designates 209.85.221.198 as permitted sender) Received: from [209.85.221.198] (HELO mail-qy0-f198.google.com) (209.85.221.198) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Jun 2009 18:41:51 +0000 Received: by qyk36 with SMTP id 36so5537329qyk.29 for ; Mon, 29 Jun 2009 11:41:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=kzMfqfxs3iLsHH19G9BLeSQ1fbsgZIzrfZPtpeLmP4M=; b=WHZs21fRHcT9oPRSxUtmERGFWf4KHj3rbA2OWCg9LJvvTRZxFr9V9Hxe7EXVtKxViS xnMBSh0s0btFjEoqswiqemsGT9A2lfui+6+SBtmUryZkAObpp6mOOUoheFjNr3uVzX7h N2xoB2qvfe4NtX2VnhbdkqEuLLOHYrGLbTs10= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=ijoaONBmAEXS6qniws9argiOHtfk7wADT710gMWtqarw73vj75s8Im7KAQPL3Jyjrm zJ+5a9oDrfNwUZnymmvwaf2YV5m1WLF5uPLHjMlnsfZoaRCDT3DhF4KVtP0GxvHzROpE iTu/cW9Saac4Aeea4kO9ZsgJtfp4I07oVVwm8= MIME-Version: 1.0 Received: by 10.220.91.133 with SMTP id n5mr2009300vcm.102.1246300886802; Mon, 29 Jun 2009 11:41:26 -0700 (PDT) In-Reply-To: <24258634.post@talk.nabble.com> References: <24228145.post@talk.nabble.com> <24258634.post@talk.nabble.com> Date: Mon, 29 Jun 2009 14:41:26 -0400 Message-ID: <359a92830906291141y5fe1c111p7f8639d8568c3469@mail.gmail.com> Subject: Re: Lucene Term Encoder From: Erick Erickson To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016e647622aa4dbc4046d810990 X-Virus-Checked: Checked by ClamAV on apache.org --0016e647622aa4dbc4046d810990 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit You probably need to make sure you understand analyzers beforeyou think about escaping/encoding. For instance, if you use StandardAnalyzer when indexing the text "Las Vegas-Food Dining Place" would index the tokens las vegas food dining place nary a hyphen to be seen. If you used StandardAnalyzer for querying, all would be fine. I strongly recommend you get a copy of Luke to examine the results of using different analyzers because different analyzers do different things. A good starting point is to just not worry about this at all and use the *same* analyzer for indexing and searching, then modify as necessary. Best Erick On Mon, Jun 29, 2009 at 1:44 PM, John Seer wrote: > > Hello Simon, > > I am looking for some class which automaticly will take care of text and > convert it into text which can be used in query. The same way as URLEncoder > encodes string for URL for example: > > Term: Las Vegas-Food AND Dining place > > After encoding term: Las Vagas(escapedDash)Food and Dining place. > > Basilcy my main problem is dash for this moment I don't know how to search > for term which has a dash in it > > > > Thanks > > > > Simon Willnauer wrote: > > > > Hi John, > > > > what do you mean by encoding? If you can be more clear about what you > > are looking for you might get help easily. > > > > simon > > On Sat, Jun 27, 2009 at 12:27 AM, John Seer wrote: > >> > >> Hello, > >> Is there any class in lucene which will do encoding for term? > >> > >> > >> Thanks > >> -- > >> View this message in context: > >> http://www.nabble.com/Lucene-Term-Encoder-tp24228145p24228145.html > >> Sent from the Lucene - Java Users mailing list archive at Nabble.com. > >> > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > >> For additional commands, e-mail: java-user-help@lucene.apache.org > >> > >> > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > > For additional commands, e-mail: java-user-help@lucene.apache.org > > > > > > > > -- > View this message in context: > http://www.nabble.com/Lucene-Term-Encoder-tp24228145p24258634.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --0016e647622aa4dbc4046d810990--