Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 90928 invoked from network); 7 Apr 2008 21:17:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 7 Apr 2008 21:17:38 -0000 Received: (qmail 98977 invoked by uid 500); 7 Apr 2008 21:17:32 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 98686 invoked by uid 500); 7 Apr 2008 21:17:31 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 98675 invoked by uid 99); 7 Apr 2008 21:17:31 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Apr 2008 14:17:31 -0700 X-ASF-Spam-Status: No, hits=3.8 required=10.0 tests=MIME_QP_LONG_LINE,RCVD_IN_BL_SPAMCOP_NET,SPF_PASS,UNPARSEABLE_RELAY X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [62.219.78.209] (HELO smtpin.livedns.co.il) (62.219.78.209) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Apr 2008 21:16:40 +0000 Received: from ([127.0.0.1]) with MailEnable ESMTP; Tue, 08 Apr 2008 00:16:39 +0200 Received: from ([80.250.149.60]) by mail.livedns.co.il with MailEnable WebMail; Tue, 8 Apr 2008 00:13:09 +0300 To: From: "Itamar Syn-Hershko" Subject: Why Lucene has to rewrite queries prior to actual searching? Date: Tue, 8 Apr 2008 00:13:09 +0300 Message-ID: <4DD28DFBE83445D39F0177E2FC3C4C68.MAI@mail.livedns.co.il> MIME-Version: 1.0 X-Mailer: MailEnable WebMail.NET X-MimeOLE: Produced By MailEnable WebMail.NET V3.14.0.0 X-Read: 0 Content-Type: multipart/mixed; boundary="--=_Part_83B3577A0CAC48F4A77D3DFB7F7766CD" X-Priority: 3 X-MSMail-Priority: Medium X-Virus-Checked: Checked by ClamAV on apache.org ----=_Part_83B3577A0CAC48F4A77D3DFB7F7766CD Content-Type: text/plain; charset="windows-1255" Content-Transfer-Encoding: quoted-printable Hi all, Can someone from the experts here explain why Lucene has to get a "rewritten" query for the Searcher - so Phrase or Wildcards queries have to rewrite themselves into a "primitive" query, that is then passed to Lucene to look for=3F I'm probably not familiar too much with the internals of Lucene, but I'd imagine that if you can inflate a query using wildcards via xxxxQuery sub classing, you could as easily (=3F) have some sort of Filter mechanism during the search, so that Lucene retrieves the Position vectors for all the terms that pass that filter, instead of retrieving only the position data for deterministic terms (with no wildcards etc.). If that was possible to do somehow, it could greatly increase the searchability of Lucene indices by using RegEx (without re-writing and getting the dreaded MaxClauseCount error) and similar. Would love to hear some insights on this one. Itamar. ----=_Part_83B3577A0CAC48F4A77D3DFB7F7766CD Content-Type: text/plain; charset=us-ascii --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org ----=_Part_83B3577A0CAC48F4A77D3DFB7F7766CD--