Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7A90710F5F for ; Tue, 25 Mar 2014 08:26:00 +0000 (UTC) Received: (qmail 81230 invoked by uid 500); 25 Mar 2014 08:25:58 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 80865 invoked by uid 500); 25 Mar 2014 08:25:56 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 80852 invoked by uid 99); 25 Mar 2014 08:25:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Mar 2014 08:25:52 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.128.179 as permitted sender) Received: from [209.85.128.179] (HELO mail-ve0-f179.google.com) (209.85.128.179) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Mar 2014 08:25:47 +0000 Received: by mail-ve0-f179.google.com with SMTP id db12so127551veb.38 for ; Tue, 25 Mar 2014 01:25:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=LBNRJ7d6nxt0raFcwytHAhJPK18qDFEzKwrUC45qNrA=; b=aQReqWSZC3ihDQp/5oOQgLDhBasuQklantSQQlkJMiq4PCf58nsjIb53FTsjftZgyD IVPqi9nl+Gzkuz8IjCX9VcgBCIoiq9ahNJVlc48Ql+Gld3ZY3ses7lRXzoqShXCuOojp c3oYrGvcSMfrMzRV924NAJu707YS0zOQwmBIvwld5c90OyYERgxxqBMKGEM3W8M3g5qT Th+gD8mvYq9CV3dJY5Sxq5x5Lg/cAd0oK+mZCyBwn312VmCWEMclizt2QD3F41EuUDYt EWhGAH6YIpAyGl+JfHwSsYTS0XYrFJ6SA1qz1Noo9NLGzz5MTO2FFvq9kXprv24oY6cy GiUg== MIME-Version: 1.0 X-Received: by 10.221.26.10 with SMTP id rk10mr54820666vcb.0.1395735926949; Tue, 25 Mar 2014 01:25:26 -0700 (PDT) Received: by 10.52.69.234 with HTTP; Tue, 25 Mar 2014 01:25:26 -0700 (PDT) In-Reply-To: References: Date: Tue, 25 Mar 2014 04:25:26 -0400 Message-ID: Subject: Re: Index size for Same DataSet. From: Erick Erickson To: java-user Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org You're probably fine. Part of indexing is merging segments, and when segments are merged the data from deleted (or updated) documents is reclaimed. Any slight variance in the commit algorithm will potentially reclaim more or less space. What happens if you optimize (forceMerge) as a final step? That will reclaim all deleted space.... Best, Erick On Tue, Mar 25, 2014 at 1:36 AM, Jose Carlos Canova wrote: > Hello, > > I have a doubt about index size, > I am testing a program using Lucene to index some dataset. > > At the final the result of index size is varying a little, since i haven't > finished the tests at all, i'm doubt if it is normal the index size vary on > size among different tests. > > att. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org