Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 98095 invoked from network); 11 Nov 2009 22:49:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Nov 2009 22:49:20 -0000 Received: (qmail 61029 invoked by uid 500); 11 Nov 2009 22:49:18 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 60962 invoked by uid 500); 11 Nov 2009 22:49:18 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 60952 invoked by uid 99); 11 Nov 2009 22:49:18 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Nov 2009 22:49:18 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.65.145.77] (HELO p02c12o144.mxlogic.net) (208.65.145.77) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Nov 2009 22:49:08 +0000 Received: from unknown [67.132.206.99] (EHLO donkey.hmsonline.com) by p02c12o144.mxlogic.net(mxl_mta-6.4.0-2) with ESMTP id e4f3bfa4.0.744657.00-016.1552225.p02c12o144.mxlogic.net (envelope-from ); Wed, 11 Nov 2009 15:48:47 -0700 (MST) X-MXL-Hash: 4afb3f4f21fb9237-85f740326c478c13cfad05dc407cb495af74a485 Received: from exchange.zoo.hmsonline.com (exchange.zoo.hmsonline.com [10.67.89.42]) by donkey.hmsonline.com (8.13.1/8.13.1) with ESMTP id nABMmksY024948 for ; Wed, 11 Nov 2009 17:48:46 -0500 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01CA6321.20877D36" Subject: Edit distance and wildcard searching with PhraseQuery Date: Wed, 11 Nov 2009 17:48:45 -0500 Message-ID: <10240262DC30A048A78651DD1559C3C5D4D600@exchange.zoo.hmsonline.com> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Edit distance and wildcard searching with PhraseQuery Thread-Index: AcpjISAXwAX/q1ZbRQq2mCl8W8V0qw== From: "Jeff Plater" To: X-Spam: [F=0.2000000000; CM=0.500; S=0.200(2009110601)] X-MAIL-FROM: X-SOURCE-IP: [67.132.206.99] X-AnalysisOut: [v=1.0 c=1 a=xucuoEz0qoYA:10 a=xdWuvbPJTlx+8X/s3BW2YQ==:17 ] X-AnalysisOut: [a=u-Vgs03FXnyylgYXKmEA:9 a=Caq2J6BKcifQSbUm_RAkLNdBYMoA:4 ] X-AnalysisOut: [a=SSmOFEACAAAA:8 a=zdIQNgv6Bd4zLDEmgCoA:9 a=9YI5pibBhk6SrJ] X-AnalysisOut: [ODKOgA:7 a=jMqReETDTVPhxl9_bU4mURAYl8IA:4] X-Virus-Checked: Checked by ClamAV on apache.org ------_=_NextPart_001_01CA6321.20877D36 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi, =20 I am trying to figure out a way that I can query a Lucene index for a phrase but have some fuzziness (edit distance and/or wildcard) applied to the individual terms. An example should help explain what I am trying to do: =20 Index contains: Philadelphia PA =20 Search is done on: Philadelphid PA =20 I want it to result in a hit - basically something like "Philadelphid~0.75 PA" (that syntax is not valid but explains what I am looking for). Similarly, I would like to be able to do something like "Phil* PA" and get a hit as well. =20 Does anyone know how I can accomplish this? Right now I am having to hit a look up table to translate the city before searching against the main index - not a fan of this option. =20 Thanks. =20 -Jeff Plater =20 ------_=_NextPart_001_01CA6321.20877D36--