Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 77397 invoked from network); 11 Aug 2009 21:03:12 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Aug 2009 21:03:12 -0000 Received: (qmail 97816 invoked by uid 500); 11 Aug 2009 20:57:21 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 97782 invoked by uid 500); 11 Aug 2009 20:57:21 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 97774 invoked by uid 99); 11 Aug 2009 20:57:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Aug 2009 20:57:21 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [208.97.132.207] (HELO spunkymail-a12.g.dreamhost.com) (208.97.132.207) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Aug 2009 20:57:11 +0000 Received: from [10.0.0.77] (adsl-065-013-152-164.sip.rdu.bellsouth.net [65.13.152.164]) by spunkymail-a12.g.dreamhost.com (Postfix) with ESMTP id 3B0347FA8 for ; Tue, 11 Aug 2009 13:56:51 -0700 (PDT) Message-Id: <762DAB9B-4B30-4B6C-B9D4-1E0101371F47@apache.org> From: Grant Ingersoll To: java-dev@lucene.apache.org In-Reply-To: <4A81C49E.4000407@gmail.com> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v936) Subject: Re: who clears attributes? Date: Tue, 11 Aug 2009 16:56:50 -0400 References: <23A9FD4876BD40B3BE312126E12B3E18@VEGA> <59b3eb370908101100xda2d85ag222599ad11a1f1ba@mail.gmail.com> <345ECC25630C46B8B55AED45F8B2C691@VEGA> <4266B483-E969-44B0-90EC-810132D5A4B2@apache.org> <786fde50908101412y4f52ffdcp71e7482c10550082@mail.gmail.com> <4A809F06.3000508@gmail.com> <4A80CE34.8030208@gmail.com> <4A812BC7.7000408@gmail.com> <9FAFA8FA-D5BE-45F1-AD53-9BD6C2FEE5C4@apache.org> <4A81C49E.4000407@gmail.com> X-Mailer: Apple Mail (2.936) X-Virus-Checked: Checked by ClamAV on apache.org On Aug 11, 2009, at 3:21 PM, Michael Busch wrote: > On 8/11/09 4:13 AM, Grant Ingersoll wrote: >> >> On Aug 11, 2009, at 4:28 AM, Michael Busch wrote: >> >>> >>>> I'm not just responding to just you there, but more to the >>>> growing pack of those speaking against the new API. I don't see >>>> specific issues being brought up - the only issues I have seen >>>> brought up have been addressed in JIRA issues that have received >>>> no comments indicating the fix was not good enough. So we are >>>> seeing a lot of general complaints, but specific complaints have >>>> been addressed as far as I can tell. >>>> >>> Thanks Mark. Yeah, I'm really not sure what actually the problem >>> here is now. There was a performance test in Solr that apparently >>> ran much slower after upgrading to the new Lucene jar. This test >>> is testing a rather uncommon scenario: very very short documents. >> >> That is not an uncommon scenario. Solr has very, very short fields >> _ALL THE TIME_. >> > > I meant that having documents that only contain very short fields is > not as common as having docs with a decent amount of text. Maybe > I'm wrong - in either case I didn't try to say it's not an important > use case. I think it is important to have good performance here > too. The point I was trying to make was that we tested performance > more thoroughly for the case we thought would be more common. FWIW, I think the most common scenario is: one or two large fields and several (usually in the range of 5-10, but have seen cases with many) small fields, at least that has been my experience. Some of the small fields require analysis, some don't. > > According to the numbers posted on LUCENE-1796 it now seems like > it's fixed - even for documents with only very short fields and no > reusable TokenStreams. > Very cool. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org