Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 61795 invoked from network); 16 Aug 2008 07:10:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Aug 2008 07:10:38 -0000 Received: (qmail 78333 invoked by uid 500); 16 Aug 2008 07:10:29 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 78291 invoked by uid 500); 16 Aug 2008 07:10:29 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 78280 invoked by uid 99); 16 Aug 2008 07:10:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Aug 2008 00:10:29 -0700 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.198.236] (HELO rv-out-0506.google.com) (209.85.198.236) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Aug 2008 07:09:33 +0000 Received: by rv-out-0506.google.com with SMTP id f6so1743553rvb.5 for ; Sat, 16 Aug 2008 00:10:00 -0700 (PDT) Received: by 10.141.62.15 with SMTP id p15mr2002179rvk.144.1218870599915; Sat, 16 Aug 2008 00:09:59 -0700 (PDT) Received: by 10.141.83.4 with HTTP; Sat, 16 Aug 2008 00:09:59 -0700 (PDT) Message-ID: <1f8734680808160009p3cf159ob012037f35c1be89@mail.gmail.com> Date: Sat, 16 Aug 2008 09:09:59 +0200 From: "Michael Zehrer" Sender: michael@zepan.net To: java-user@lucene.apache.org Subject: Re: Efficient delete In-Reply-To: <36F19444-085E-4001-9C6A-D8C06302B062@mikemccandless.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <1f8734680808070546q6d36cbe8m6687b70f7782d6df@mail.gmail.com> <8c4e68610808080223p343f390i1fc3d2a5e42b6a7e@mail.gmail.com> <36F19444-085E-4001-9C6A-D8C06302B062@mikemccandless.com> X-Google-Sender-Auth: cc718da62d64e065 X-Virus-Checked: Checked by ClamAV on apache.org Mike, what is the difference if I would use IndexWriter? Michael On Fri, Aug 8, 2008 at 12:27, Michael McCandless wrote: > > Also, can you do your deletes via IndexWriter (delete by Term) instead of > opening IndexReader to do the deletes? > > Mike > > Ian Lea wrote: > >> Michael >> >> >> Did you get anywhere with this? 3 secs for one delete is excessive. A >> job of mine ran earlier today and did 2000+ deletes by term on unique >> id in less than 9 seconds. The index is smaller, at around 5Gb, but I >> don't believe that would explain the difference. All the deletes were >> done in one batch using lucene 2.3.2. >> >> Is there a lot of other concurrent activity on the index? On the >> server? Is it CPU or IO bound? >> >> >> -- >> Ian. >> >> >> >> On Thu, Aug 7, 2008 at 1:46 PM, Michael Zehrer wrote: >>> >>> Dear List, >>> >>> I have a rather big index around 20gb. My documents have a unique id >>> that I store in in an untokenized field. >>> >>> Using an IndexReader I delete documents by term using the id. The >>> applications tries to batch as many delete operations as possible for >>> this. >>> >>> The applications runs on a 8Core Machine with 1GB heapsize for the >>> JVM. Each delete takes about 3sec to complete. I have a lot of Updates >>> during a day so this takes a lot of time and in effect this is too >>> slow. >>> >>> Is there anything I can do to speed things up a little? >>> >>> Best, >>> >>> Michael >>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >>> For additional commands, e-mail: java-user-help@lucene.apache.org >>> >>> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org