Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 33976 invoked from network); 30 Apr 2009 13:46:52 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Apr 2009 13:46:52 -0000 Received: (qmail 85556 invoked by uid 500); 30 Apr 2009 13:46:50 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 85501 invoked by uid 500); 30 Apr 2009 13:46:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 85491 invoked by uid 99); 30 Apr 2009 13:46:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Apr 2009 13:46:50 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.221.127 as permitted sender) Received: from [209.85.221.127] (HELO mail-qy0-f127.google.com) (209.85.221.127) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Apr 2009 13:46:43 +0000 Received: by qyk33 with SMTP id 33so2062128qyk.29 for ; Thu, 30 Apr 2009 06:46:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=DoSK/7L30B4mFduSLJMEKpzIAIYyrcuFBdrGKMIg71M=; b=O4if9dawff3nuhGfECEYQdb5LLipOyRFL7dZHEKe0YZVkOS86MfFOkINjJvSpu4d4L Z4DChfhgtPD2JpDjNvWW43AlmNoWRMdLBscX+2z/nlxc1v25gn+h8/X3ArvLXbQsDlhy mWOJ8OV9ZcobVQ/0JiIZ9OuzolB7Pcwt9rxAo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=pbN96PVY7u14K5RZ0+8Vxjb2FT7LqtwpXerww6jBVQXQ9c2meU9VVF2zenN+Rg6KzU PnGCCMmi7yoAWsCZx/P3sAOkf2jHBGdFjySv6gHyhzCVYOlXvrw54sv1laTEvTnSP2sx lhhqK26hVG/TOaR3+iGg94Iu40dlsvPAuQylA= MIME-Version: 1.0 Received: by 10.220.45.131 with SMTP id e3mr3243291vcf.89.1241099181889; Thu, 30 Apr 2009 06:46:21 -0700 (PDT) In-Reply-To: References: <8c4e68610904300130m44a7e41ekb830af82bbe5f8dd@mail.gmail.com> Date: Thu, 30 Apr 2009 09:46:21 -0400 Message-ID: <359a92830904300646v3120eed8if0768103a707228b@mail.gmail.com> Subject: Re: Indexing becomes slow with time From: Erick Erickson To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=0016e6470526de18550468c5eb24 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6470526de18550468c5eb24 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit This is surprising behavior, which is another way of saying that, given what you've said so far, this shouldn't be happening. I'd really look at system metrics, like whether you're swapping etc. In particular you might want to try varying how big you allow your memory footprint to grow before you flush, this is in the doc Ian pointed out under * Flush by RAM usage instead of document count* There's no need to periodically optimize, just do that at the end if you must. Best Erick On Thu, Apr 30, 2009 at 6:23 AM, liat oren wrote: > Yes, I do run optimize... > > I did start looking at these tips in the last few days, but didn't think > the > optimize makes it so slow. > > Thanks! > > 2009/4/30 Ian Lea > > > Are you maybe running optimize after every n documents? There are > > lots of tips in > > http://wiki.apache.org/lucene-java/ImproveIndexingSpeed. > > > > > > -- > > Ian. > > > > > > On Thu, Apr 30, 2009 at 8:29 AM, liat oren wrote: > > > Hi, > > > > > > I noticed that when I start to index, it indexes 7 documents a second. > > After > > > 30 minutes it goes down to 3 documents a second. > > > After two hours it becomes very slow (I stopped it when it arrived to > > 320MB > > > and did 1 document in almost a minute) > > > > > > As you can see, it happens only after 2000, 3000 documnet. > > > Should I split them into more indexes? > > > > > > > > > Thanks, > > > Liat > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > > For additional commands, e-mail: java-user-help@lucene.apache.org > > > > > --0016e6470526de18550468c5eb24--