Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 93353 invoked from network); 30 Sep 2009 03:55:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Sep 2009 03:55:18 -0000 Received: (qmail 34107 invoked by uid 500); 30 Sep 2009 03:55:15 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 34021 invoked by uid 500); 30 Sep 2009 03:55:13 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 34011 invoked by uid 99); 30 Sep 2009 03:55:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Sep 2009 03:55:13 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ihasmax@gmail.com designates 209.85.218.214 as permitted sender) Received: from [209.85.218.214] (HELO mail-bw0-f214.google.com) (209.85.218.214) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Sep 2009 03:55:04 +0000 Received: by bwz10 with SMTP id 10so4964688bwz.5 for ; Tue, 29 Sep 2009 20:54:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=2/zlJDLf/bda24v9GtgWAuizRQHHlqzfSUV2TbJKauA=; b=Mykec2RtZshM0pKYc2XZpJQ2s7WMeBmWgXibVSKFYZ4I2BMW/NWObilH9928zcsYUD z/WuDLppAKBfvV/PAMH9wRsVeTqNX6B93FBesGyRv7sNQ0PTIy9IBkTarjgC+sTnvWBU 9Wdiz8j6EYJT4Pr6tnV/bGqE4+77R0JlJ706s= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=LjVenWLvIDYWi1ipZdcqbUW0zp38a3y0vSGLkMe7EZ+aDAuQTsbrUWdA5g/jsB25UZ X3UTNgi+m9UCvyZXkx/eWpu89Ksw5rZlMC3MBVpqWBG+m2A3GSpHq9ycj6b5bOcbEi+o Xgs8JuPDs3QGMpEPny8NYf7cys5oePfvG0dhg= MIME-Version: 1.0 Received: by 10.103.126.33 with SMTP id d33mr2102985mun.109.1254282883920; Tue, 29 Sep 2009 20:54:43 -0700 (PDT) Date: Tue, 29 Sep 2009 22:54:43 -0500 Message-ID: <3836ec640909292054h4e92715bq1ae1b373f79ebf30@mail.gmail.com> Subject: Whitespace/Standard Analyzer and punctuation From: Max Lynch To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016e65b3ffcbe734b0474c37d7f X-Virus-Checked: Checked by ClamAV on apache.org --0016e65b3ffcbe734b0474c37d7f Content-Type: text/plain; charset=ISO-8859-1 I would like my searches to match "John Smith" when John Smith is in a document, but not separated with punctuation. For example, when I was using StandardAnalyzer, "John. Smith" was matching, which is wrong for me. Right now I am using WhitespaceAnalyzer but instead searching for "John Smith" "John Smith." "John Smith," etc., which seems like a dumb thing to be doing. Can I separate the punctuation but keep the analyzer aware of where the punctuation occurred in my matching term? Thanks. --0016e65b3ffcbe734b0474c37d7f--