Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 40135 invoked from network); 10 Jun 2005 17:09:37 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 10 Jun 2005 17:09:37 -0000 Received: (qmail 75230 invoked by uid 500); 10 Jun 2005 17:09:29 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 75192 invoked by uid 500); 10 Jun 2005 17:09:28 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 75156 invoked by uid 99); 10 Jun 2005 17:09:28 -0000 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=DNS_FROM_RFC_ABUSE,SPF_HELO_PASS X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from smtpauth09.mail.atl.earthlink.net (HELO smtpauth09.mail.atl.earthlink.net) (209.86.89.69) by apache.org (qpsmtpd/0.28) with ESMTP; Fri, 10 Jun 2005 10:09:27 -0700 Received: from [69.227.26.161] (helo=[69.227.26.161]) by smtpauth09.mail.atl.earthlink.net with asmtp (TLSv1:RC4-SHA:128) (Exim 4.34) id 1Dgn0R-0004ig-58 for java-user@lucene.apache.org; Fri, 10 Jun 2005 13:09:19 -0400 Mime-Version: 1.0 (Apple Message framework v730) In-Reply-To: <20050610163311.74855.qmail@web50607.mail.yahoo.com> References: <20050610163311.74855.qmail@web50607.mail.yahoo.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: <62F51FD5-E137-49F2-AD08-AE5577B51972@corp.earthlink.net> Content-Transfer-Encoding: 7bit From: "Peter A. Friend" Subject: Re: Optimizing indexes with mulitiple processors? Date: Fri, 10 Jun 2005 10:09:16 -0700 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.730) X-ELNK-Trace: e38a400786bfeea01aa676d7e74259b7b3291a7d08dfec79d0fba56b4bb3a8194a0fd1ce717cc7ce350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c X-Originating-IP: 69.227.26.161 X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On Jun 10, 2005, at 9:33 AM, Chris Collins wrote: > How many documents did you try to index? Only about 4000 at the moment. > I am using a relatively large > minMergeDoc that causes me to run out of memory when I make such a > change. (I > am using 1/2 gb of heap btw). I was running out of memory as well until I gave Java a larger heap to work with. I am assuming that a dedicated indexing machine (as well as search) is going to need a mountain of memory. I figure I will be giving Java gigs to play with. > I believe changing it in the outputstream object > means that a lot of in memory only objects use that size too. This I need to look into. At a guess, I would think that there would be an OutputStream object for each open segment, and each file in that segment. A consolidated index *might* use less but of course we are trying to improve performance here, and the consolidated index does incur a cost. Assuming 10 segments and 10 files within each segment, that's 100 OutputStream objects or 809,600 bytes. That'll grow quickly with merge tweaks. Those larger writes do save a bunch of system calls and make (maybe) better use of your filers block size. This grows quickly with maxMerge tweaks. Of course this could be utterly incorrect, I need to look into this a bit more carefully. > I dont know I would of used truss in this regard, this only points > out what > size hit the kernel not what went over the wire. I would suggest > using > ethereal to ensure thats how its ending up on the wire. True, hadn't gotten that far yet. :-) Peter --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org