Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 44393 invoked from network); 26 May 2004 21:41:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 26 May 2004 21:41:53 -0000 Received: (qmail 50897 invoked by uid 500); 26 May 2004 21:42:22 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 50825 invoked by uid 500); 26 May 2004 21:42:21 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 50805 invoked by uid 98); 26 May 2004 21:42:21 -0000 Received: from james_h_dunn@yahoo.com by hermes.apache.org by uid 82 with qmail-scanner-1.20 (clamuko: 0.70. Clear:RC:0(66.218.93.17):. Processed in 0.016841 secs); 26 May 2004 21:42:21 -0000 X-Qmail-Scanner-Mail-From: james_h_dunn@yahoo.com via hermes.apache.org X-Qmail-Scanner: 1.20 (Clear:RC:0(66.218.93.17):. Processed in 0.016841 secs) Received: from unknown (HELO web41101.mail.yahoo.com) (66.218.93.17) by hermes.apache.org with SMTP; 26 May 2004 21:42:21 -0000 Message-ID: <20040526214140.62527.qmail@web41101.mail.yahoo.com> Received: from [12.13.3.253] by web41101.mail.yahoo.com via HTTP; Wed, 26 May 2004 14:41:40 PDT Date: Wed, 26 May 2004 14:41:40 -0700 (PDT) From: James Dunn Subject: Re: Memory usage To: Lucene Users List In-Reply-To: <40B50C78.4020200@apache.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Rating: hermes.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Doug, Thanks! I just asked a question regarding how to calculate the memory requirements for a search. Does this memory only get used only during the search operation itself, or is it referenced by the Hits object or anything else after the actual search completes? Thanks again, Jim --- Doug Cutting wrote: > James Dunn wrote: > > Also I search across about 50 fields but I don't > use > > wildcard or range queries. > > Lucene uses one byte of RAM per document per > searched field, to hold the > normalization values. So if you search a 10M > document collection with > 50 fields, then you'll end up using 500MB of RAM. > > If you're using unanalyzed fields, then an easy > workaround to reduce the > number of fields is to combine many in a single > field. So, instead of, > e.g., using an "f1" field with value "abc", and an > "f2" field with value > "efg", use a single field named "f" with values > "1_abc" and "2_efg". > > We could optimize this in Lucene. If no values of > an indexed field are > analyzed, then we could store no norms for the field > and hence read none > into memory. This wouldn't be too hard to > implement... > > Doug > > --------------------------------------------------------------------- > To unsubscribe, e-mail: > lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: > lucene-user-help@jakarta.apache.org > __________________________________ Do you Yahoo!? Friends. Fun. Try the all-new Yahoo! Messenger. http://messenger.yahoo.com/ --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org