Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 326 invoked from network); 11 Aug 2009 18:02:40 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Aug 2009 18:02:40 -0000 Received: (qmail 47054 invoked by uid 500); 11 Aug 2009 18:01:34 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 46460 invoked by uid 500); 11 Aug 2009 18:01:31 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 45150 invoked by uid 99); 11 Aug 2009 17:54:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Aug 2009 17:54:53 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of markrmiller@gmail.com designates 209.85.217.218 as permitted sender) Received: from [209.85.217.218] (HELO mail-gx0-f218.google.com) (209.85.217.218) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Aug 2009 17:54:43 +0000 Received: by gxk18 with SMTP id 18so4748339gxk.5 for ; Tue, 11 Aug 2009 10:54:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from :user-agent:mime-version:to:subject:content-type :content-transfer-encoding; bh=esinahYqpNNhvgAZLl1KyfoM6tjc8bfODwmc6Rf/NhI=; b=SGWanZ+0jlielOSYvf75nNjoXOjj1iO7sm5z1gm0CctZZ+oRc2MZlQ+mqqZvjFrFha yXEsAV94HhDFvOU1f+llfhIVGJ8mY1HMVKtf/FDua+NkZWaXT3/tOfm4lj6TO1h/tdOR UbEPiIgythr6LKS0oWX3HsXrbWHBBniQKB0BA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject :content-type:content-transfer-encoding; b=vvlBVpM7Fga1VuYu4oHlrWC4F/ILZAXJRgW4Ok4jA5hRsA8ebvlu55OQXMQLVy605G izbI80KHNVHaXNlOAWyc6SrKknrTB+ejJfnRvLOqAt9j5717bCZ1tWK+PriWijVnSLjG WszOVXxqjqNmnbmDrs7d0gkBQZBJOASbZplD4= Received: by 10.90.82.17 with SMTP id f17mr5147219agb.114.1250013262756; Tue, 11 Aug 2009 10:54:22 -0700 (PDT) Received: from ?192.168.1.102? (ool-44c639d9.dyn.optonline.net [68.198.57.217]) by mx.google.com with ESMTPS id 27sm283125agb.2.2009.08.11.10.54.21 (version=SSLv3 cipher=RC4-MD5); Tue, 11 Aug 2009 10:54:22 -0700 (PDT) Message-ID: <4A81B04D.3010808@gmail.com> Date: Tue, 11 Aug 2009 13:54:21 -0400 From: Mark Miller User-Agent: Thunderbird 2.0.0.22 (X11/20090608) MIME-Version: 1.0 To: java-dev@lucene.apache.org Subject: The new Contrib QueryParser should not be slated to replace the old one yet Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org I don't think we should stick with the current path of replacing the current QueryParser with the new contrib QueryParser in Lucene 3.0. The new QueryParser has not been used much at all yet. Its interfaces (which will need to abide by back compat in core) have not been vetted enough. The new parser appears to add complication to some of things that were very simple with the old parser. The main benefits of the new parser are claimed to be the ability to plug and play many syntaxes and QueryBuilders. This is not an end user benefit though and I'm not even sure how much of a benefit it is to us. There is currently only one impl. It seems to me, once you start another impl, its a long shot that the exact same query tree representation is going to work with a completely different syntax. Sure, if you are just doing postfix rather than prefix, it will be fine � but the stuff that would likely be done � actual new syntaxes � are not likely to be very pluggable. If a syntax can map to the same query tree, I think we would likely stick to a single syntax � else suffer the confusion and maintenance headaches for syntactic sugar. More than a well factored QueryParser that can more easily allow different syntaxes to map to the same query tree representation, I think we just want a single solid syntax for core Lucene that supports Spans to some degree. We basically have that now, sans the spans support. Other, more exotic QueryParsers should live in contrib, as they do now. Which isn't to say this QueryParser should not one day rule the roost � but I don't think its earned the right yet. And I don't think there is a hurry to toss the old parser. Personally, I think that the old parser should not be deprecated. Lets let the new parser breath in contrib for a bit. Lets see if anyone actually adds any other syntaxes. Lets see if the pluggability results in any improvements. Lets see if some of the harder things to do (overriding query build methods?) become easier or keep people from using the new parser. Lets just see if the new parser draws users without us forcing them to it. And lets also wait and see what other committers say � not many have gotten much time to deal with the new parser, or deal with user list questions on it. I just think its premature to start moving people to this new parser. It didn't even really get in until right before release � the paint on the thing still reeks. There is no rush. I saw we undeprecate the current QueryParser and remove the wording in the new QueryParser about it replacing the new in 3.0. Later, if we think it should replace it (after having some experience to judge from), we can reinstate the current plan. Anyone agree? -- - Mark http://www.lucidimagination.com --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org