Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 40044 invoked from network); 2 Dec 2003 13:55:08 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 2 Dec 2003 13:55:08 -0000 Received: (qmail 40699 invoked by uid 500); 2 Dec 2003 13:54:59 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 40672 invoked by uid 500); 2 Dec 2003 13:54:58 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 40653 invoked from network); 2 Dec 2003 13:54:58 -0000 Received: from unknown (HELO web25204.mail.ukl.yahoo.com) (217.12.10.64) by daedalus.apache.org with SMTP; 2 Dec 2003 13:54:58 -0000 Message-ID: <20031202135458.73205.qmail@web25204.mail.ukl.yahoo.com> Received: from [212.126.153.189] by web25204.mail.ukl.yahoo.com via HTTP; Tue, 02 Dec 2003 13:54:58 GMT Date: Tue, 2 Dec 2003 13:54:58 +0000 (GMT) From: =?iso-8859-1?q?jt=20oob?= Subject: Ways to search indexes To: Lucene-Users-List MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi, I have just indexed a lot of news (nntp) postings. I now have an index for each topic (a topic can have many newsgroups) The index sizes are: 2.6G Current Affairs 2.4G Celebs 119M Recreation 3.0M Tech - Mac 2.4G Tech - Windows 936M Tech - Linux 702M Tech - Other 96M Tech - Consoles This is still only early stages so i haven't yet done any parsing, just treating each doc as plain text. Originally I was merging all these indexes together, but this is now not feasible with new additions being made to each index as new postings arrive. I optimize each index at midnight. What is the best way to allow users to query either just one index, or the whole lot? My prototype was making a system call from and running my java program to print all the results to the screen. I know this isn't the best way to do it :-) I guess I need to write a server and periodically re-open the indexes to see any changes? Thank you for any help! jt ________________________________________________________________________ Download Yahoo! Messenger now for a chance to win Live At Knebworth DVDs http://www.yahoo.co.uk/robbiewilliams --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org