Return-Path: Delivered-To: apmail-incubator-lucene-net-user-archive@locus.apache.org Received: (qmail 33540 invoked from network); 6 Nov 2006 15:21:04 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Nov 2006 15:21:04 -0000 Received: (qmail 82571 invoked by uid 500); 6 Nov 2006 15:21:16 -0000 Delivered-To: apmail-incubator-lucene-net-user-archive@incubator.apache.org Received: (qmail 82378 invoked by uid 500); 6 Nov 2006 15:21:15 -0000 Mailing-List: contact lucene-net-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: lucene-net-user@incubator.apache.org Delivered-To: mailing list lucene-net-user@incubator.apache.org Received: (qmail 82360 invoked by uid 99); 6 Nov 2006 15:21:14 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Nov 2006 07:21:14 -0800 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=RCVD_IN_SORBS_WEB X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [216.222.193.2] (HELO sidekick.frogspace.net) (216.222.193.2) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Nov 2006 07:20:59 -0800 Received: from bi01p1.nc.us.ibm.com ([129.33.49.251] helo=aroushlt) by sidekick.frogspace.net with esmtp (Exim 4.44) id 1Gh6H8-0006hC-He; Mon, 06 Nov 2006 07:20:38 -0800 From: "George Aroush" To: , Subject: RE: Need some help understanding what the "StandardAnalyzer" is doing here ... Date: Mon, 6 Nov 2006 10:20:36 -0500 Message-ID: <057001c701b7$1cca9570$d6434c09@aroushlt> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 11 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2962 Thread-Index: AccBqOC9bnIAd8HbSLWH4ymFoe2ShAADQqZQ In-Reply-To: X-SA-Exim-Connect-IP: 129.33.49.251 X-SA-Exim-Mail-From: george@aroush.net X-Virus-Checked: Checked by ClamAV on apache.org Hi Andy, What's happening here is the data getting analyzed and tokenized. You need to stop that from happening on this field. There are two solution that come to mind which Lucene.Net offers, use per-field analyzer and tokenizer, or (this is easier) store the field as non-tokenized. Regards, -- George Aroush -----Original Message----- From: Andy Berryman [mailto:topdev1@gmail.com] Sent: Monday, November 06, 2006 8:38 AM To: lucene-net-user@incubator.apache.org; lucene-net-dev@incubator.apache.org Subject: Need some help understanding what the "StandardAnalyzer" is doing here ... I have an index with a Field named "SKU" which is a "Text" type. I'm using the "StandardAnalyzer" for indexing and searching. I'm using "Luke" ( http://www.getopt.org/luke/luke.jnlp) to do some testing for this problem and to allow me to see how Lucene is parsing the query etc. If I provide the search expression as ... *SKU:andyb-test-item-001* ... Lucene is parsing that to ... *SKU:"andyb test item-001"*. Soo my question is ... Why are the dashes between "andyb", "test", and "item" being removed but not the one between "item" and "001"? Thanks Andy