Return-Path: Delivered-To: apmail-lucene-solr-dev-archive@minotaur.apache.org Received: (qmail 88826 invoked from network); 11 Feb 2010 23:54:13 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Feb 2010 23:54:13 -0000 Received: (qmail 50948 invoked by uid 500); 11 Feb 2010 23:54:13 -0000 Delivered-To: apmail-lucene-solr-dev-archive@lucene.apache.org Received: (qmail 50884 invoked by uid 500); 11 Feb 2010 23:54:13 -0000 Mailing-List: contact solr-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-dev@lucene.apache.org Delivered-To: mailing list solr-dev@lucene.apache.org Received: (qmail 50867 invoked by uid 99); 11 Feb 2010 23:54:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Feb 2010 23:54:12 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [64.202.165.30] (HELO smtpauth19.prod.mesa1.secureserver.net) (64.202.165.30) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 11 Feb 2010 23:54:02 +0000 Received: (qmail 31928 invoked from network); 11 Feb 2010 23:53:40 -0000 Received: from unknown (81.219.54.251) by smtpauth19.prod.mesa1.secureserver.net (64.202.165.30) with ESMTP; 11 Feb 2010 23:53:37 -0000 Message-ID: <4B749874.1020605@getopt.org> Date: Fri, 12 Feb 2010 00:53:24 +0100 From: Andrzej Bialecki User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1.7) Gecko/20100111 Thunderbird/3.0.1 MIME-Version: 1.0 To: solr-dev@lucene.apache.org Subject: Re: Solr Performance and Scalability References: <27552013.post@talk.nabble.com> <27553353.post@talk.nabble.com> <8f0ad1f31002111411i4adb771dn75f636c561f6dbb@mail.gmail.com> In-Reply-To: <8f0ad1f31002111411i4adb771dn75f636c561f6dbb@mail.gmail.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 2010-02-11 23:11, Robert Muir wrote: > Tom, this is really completely unrelated, but given that you have such huge > documents and I see you have exceeded term count limits in lucene, i can't > help but wonder if you have ever considered Andrzej's index pruning patch? > (it is simply a tool you can run on your index) > > depending upon requirements, seems like it might be a good fit. > > http://issues.apache.org/jira/browse/LUCENE-1812 Tom, if you decide to try this I'd be happy to help you with the tools and the pruning strategies. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com