Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 87012 invoked from network); 5 Aug 2004 21:32:28 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 5 Aug 2004 21:32:28 -0000 Received: (qmail 73370 invoked by uid 500); 5 Aug 2004 21:32:21 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 73217 invoked by uid 500); 5 Aug 2004 21:32:20 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 73197 invoked by uid 99); 5 Aug 2004 21:32:19 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [12.43.53.196] (HELO mail.sofari.com) (12.43.53.196) by apache.org (qpsmtpd/0.27.1) with ESMTP; Thu, 05 Aug 2004 14:32:16 -0700 Received: from [192.168.1.100] ([12.43.53.196]) by mail.sofari.com (8.12.11/8.12.3/Debian-6.6) with ESMTP id i75LWDSV004775 for ; Thu, 5 Aug 2004 14:32:14 -0700 Message-ID: <4112A758.6010700@newsmonster.org> Date: Thu, 05 Aug 2004 14:32:08 -0700 From: "Kevin A. Burton" User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7) Gecko/20040616 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Users List Subject: Performance when computing computing a filter using hundreds of diff terms. Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N I'm trying to compute a filter to match documents in our index by a set of terms. For example some documents have a given field 'category' so I need to compute a filter with mulitple categories. The problem is that our category list is > 200 items so it takes about 80 seconds to compute. We cache it of course but this seems WAY too slow. Is there anything I could do to speed it up? Maybe run the queries myself and then combine the bitsets? We're using a BooleanQuery with nested TermQueries to build up the filter... Kevin -- Please reply using PGP. http://peerfear.org/pubkey.asc NewsMonster - http://www.newsmonster.org/ Kevin A. Burton, Location - San Francisco, CA, Cell - 415.595.9965 AIM/YIM - sfburtonator, Web - http://peerfear.org/ GPG fingerprint: 5FB2 F3E2 760E 70A8 6174 D393 E84D 8D04 99F1 4412 IRC - freenode.net #infoanarchy | #p2p-hackers | #newsmonster --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org