Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C8A80EFBE for ; Wed, 13 Feb 2013 15:15:06 +0000 (UTC) Received: (qmail 12696 invoked by uid 500); 13 Feb 2013 15:15:05 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 12523 invoked by uid 500); 13 Feb 2013 15:15:05 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 12501 invoked by uid 99); 13 Feb 2013 15:15:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Feb 2013 15:15:04 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yseeley@gmail.com designates 209.85.128.172 as permitted sender) Received: from [209.85.128.172] (HELO mail-ve0-f172.google.com) (209.85.128.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Feb 2013 15:15:00 +0000 Received: by mail-ve0-f172.google.com with SMTP id cz11so1174838veb.3 for ; Wed, 13 Feb 2013 07:14:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=N9wSSsg/7/OTnaSGub5fL0ut1TLYzfLB1bUPYgNNIfk=; b=eWTbB0mGoOr8fqx/6/MBNurb0JmJZLjLVeOBfXkgZ2Syhux9NtpQBsiZrS5wrzJCWc ebTnqRlKtebsJE58sF/YGExCvQDwoTL3c0shvWu/l7Zd9bDdU+Tbm0bcGLU0Ssqvye8q RmMHQqhr91PWRuUjH0qmumiMBhiW9QvCJAP+p4F0KSYE4keIZlKsJmMJgTWHxFEo3wYP 6PneRMx71ABpTbl6FxNVCdj8UVe4188kXEPHB7Uui5gwbizr3AtzYrrqdmLGw2FxMPkF 6w/jQ+eRG7jB/eGxSgYDpFj55QJgVDgydynsjImkvcmpniCRh0JAw3ozKmoEM8Q5yHva 2rWQ== MIME-Version: 1.0 X-Received: by 10.58.247.132 with SMTP id ye4mr29690545vec.9.1360768479394; Wed, 13 Feb 2013 07:14:39 -0800 (PST) Sender: yseeley@gmail.com Received: by 10.59.8.138 with HTTP; Wed, 13 Feb 2013 07:14:39 -0800 (PST) In-Reply-To: References: <511A9EEF.2090204@elyograg.org> Date: Wed, 13 Feb 2013 10:14:39 -0500 X-Google-Sender-Auth: Dk2fNoMAz6OUCknuACiF4aU-Yqg Message-ID: Subject: Re: New Lucene features and Solr indexes From: Yonik Seeley To: dev@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org On Wed, Feb 13, 2013 at 8:01 AM, Robert Muir wrote: > On Wed, Feb 13, 2013 at 4:42 AM, Adrien Grand wrote: >> Hi Shawn, >> >> On Tue, Feb 12, 2013 at 8:58 PM, Shawn Heisey wrote: >>> Some of these, like compressed stored fields and compressed termvectors, are >>> being turned on by default, which is awesome. I'm already running a 4.2 >>> snapshot, so I've got those in place. >> >> Excellent! >> >>> One thing that I know I would like to do is use the new BloomFilter for a >>> couple of my fields that contain only unique values. Last time I checked >>> (which was before the 4.1 release), if you added the lucene-codecs jar, Solr >>> had a BloomFilter postings format, but didn't have any way to specify the >>> underlying format. See SOLR-3950 and LUCENE-4394. >> >> BloomFilterPostingsFormat is a little special compared to other >> postings formats because it can wrap any postings format. So maybe it >> should require special support, like an additional attribute in the >> field type definition? > > -1 > > Instead of making other APIs to accomodate BloomFilter's current > brokenness: remove its custom per-field logic so it works with > PerFieldPostingsFormat, like every other PF. > > In other words, it should work just like pulsing. > > I brought this up before it was committed, and i was ignored. Thats > fine, but I'll be damned if i let its incorrect design complicate > other parts of the codebase too. I'd rather it continue to stay > difficult to integrate and continue walking its current path to an > open source death instead. Would your desired changes in bloom postings format change the high level interface in Solr (i.e. specifying bloom on a field or fieldType in the schema)? If not, any currently needed work-around seems more like an implementation detail. -Yonik http://lucidworks.com --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org