Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 24732 invoked from network); 1 Dec 2006 01:30:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 1 Dec 2006 01:30:08 -0000 Received: (qmail 349 invoked by uid 500); 1 Dec 2006 01:30:11 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 311 invoked by uid 500); 1 Dec 2006 01:30:11 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 300 invoked by uid 99); 1 Dec 2006 01:30:11 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Nov 2006 17:30:11 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of vnguyen@ur.com designates 63.241.148.49 as permitted sender) Received: from [63.241.148.49] (HELO iris.ur.com) (63.241.148.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Nov 2006 17:29:59 -0800 Received: from unknown (HELO UREXCHSRV5.ur.com) ([10.6.134.35]) by iris.ur.com with ESMTP; 01 Dec 2006 01:30:42 +0000 X-OriginatingIP: 10.6.134.35 X-InternalServer: 10.6.134.35 Received: from UREXCHVS3.ur.com ([10.6.138.34]) by UREXCHSRV5.ur.com with Microsoft SMTPSVC(6.0.3790.1830); Thu, 30 Nov 2006 20:28:06 -0500 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C714E8.2AD644BC" Subject: any ides on this type of analyzer? Date: Thu, 30 Nov 2006 20:25:18 -0500 Message-ID: <0D6A3C278F4DC346B98DF4D2F1397E811742E620@UREXCHVS3.ur.com> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: any ides on this type of analyzer? Thread-Index: AccU549+QEOVoPWWToegfMa6B7JUew== From: "Van Nguyen" To: X-OriginalArrivalTime: 01 Dec 2006 01:28:06.0323 (UTC) FILETIME=[F3C6A030:01C714E7] X-Virus-Checked: Checked by ClamAV on apache.org ------_=_NextPart_001_01C714E8.2AD644BC Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable I've been trying to brainstorm on this but could not figure out a way to go about this. =20 Let's say I'm searching for "batman". I want results that include: =20 batman bat man bat-man etc. =20 or if I search screwdriver, I would want results to include: =20 screwdriver screw drivers etc. =20 I've tried using the SnowballAnalyzer. I've thought about creating a "SynonymAnalyzer" as described in the Lucene In Action book (but that would mean I would have to know all the synonyms for each word I need to index - at this point I do not). Any suggestions on how to go about this? =20 Van ------_=_NextPart_001_01C714E8.2AD644BC--