Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 34873 invoked from network); 14 Nov 2001 22:05:15 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 14 Nov 2001 22:05:15 -0000 Received: (qmail 11281 invoked by uid 97); 14 Nov 2001 21:59:26 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 11252 invoked by uid 97); 14 Nov 2001 21:59:25 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 11237 invoked from network); 14 Nov 2001 21:59:24 -0000 Message-ID: <004001c16d57$e7481dd0$200116ac@IFRIT> From: "Brandon Jockman" To: "Lucene Users List" References: Subject: Complex query puzzle. Date: Wed, 14 Nov 2001 16:01:24 -0600 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 5.00.3018.1300 X-MimeOLE: Produced By Microsoft MimeOLE V5.00.3018.1300 X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Lucene Gurus: I have a need for complex querying with my lucene index that allows combinations of: . toggling case sensitivity on and off . toggling porter stemming on and off . toggling fuzzy matching on and off . toggling wildcard matching on and off . more... along with regular query strings that include optional date range queries. The only way I see to do this is: . Creating multiple fields for each true field. One for each combination of the options: ie. one for case sensitive & porter stemming. One for case sensitive & NOT porter stemming, etc... Doing a search on the particular one that fits the given search params. There will be many thousands of documents in the index and our logical documents are already being turned into multiple lucene documents, so all of the duplication could be very costly. Is there a better way of doing what I'm trying to do? Can you create an index using a custom analyzer using a combination of filters and search it only using a subset of those filters? I could really use advice on this... alternatives, recommendations, etc... Thanks in advance!!! -Brandon Jockman -- To unsubscribe, e-mail: For additional commands, e-mail: