Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 822FE200C2C for ; Fri, 3 Mar 2017 19:37:52 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 80D3F160B6D; Fri, 3 Mar 2017 18:37:52 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id CB194160B5E for ; Fri, 3 Mar 2017 19:37:51 +0100 (CET) Received: (qmail 32907 invoked by uid 500); 3 Mar 2017 18:37:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 32319 invoked by uid 99); 3 Mar 2017 18:37:50 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Mar 2017 18:37:50 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id B479A1A0168 for ; Fri, 3 Mar 2017 18:37:49 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.702 X-Spam-Level: X-Spam-Status: No, score=-0.702 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id lq9ltdH-cwOk for ; Fri, 3 Mar 2017 18:37:48 +0000 (UTC) Received: from mail.sd-datasolutions.de (serv2.sd-datasolutions.de [85.25.204.22]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 1E82C5FAD8 for ; Fri, 3 Mar 2017 18:37:48 +0000 (UTC) Received: from VEGA (p200300C103DA700159E7A9E7D5EC356F.dip0.t-ipconnect.de [IPv6:2003:c1:3da:7001:59e7:a9e7:d5ec:356f]) by mail.sd-datasolutions.de (Postfix) with ESMTPSA id 7A1CD16F802A3 for ; Fri, 3 Mar 2017 18:37:44 +0000 (UTC) X-NSA-Greeting: Dear NSA, have fun with reading and analyzing this e-mail! From: "Uwe Schindler" To: References: <393CE17D-24A9-4E99-A214-BBA004F85B24@bammers.net> In-Reply-To: <393CE17D-24A9-4E99-A214-BBA004F85B24@bammers.net> Subject: RE: Index size variation Date: Fri, 3 Mar 2017 19:37:42 +0100 Message-ID: <000801d2944d$3ed75de0$bc8619a0$@thetaphi.de> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Outlook 16.0 Thread-Index: AQHuq5X9p9YmjRtHXafGK+tbYyhDjaFLQfFA Content-Language: de archived-at: Fri, 03 Mar 2017 18:37:52 -0000 Hi Chris, as always: "it depends". Generally I would reserve space of = approximately the "original" index size. Most indexes that are = continuously updated have an overhead by 40% for deleted documents. The = remaining 60% are indeed used for merging. Please also keep in mind that = if IndexReaders are open while merging, you may see not all used disk = space from the directory listing, because some files that were merged = but are still open in IndexReaders already disappeared from directory = listing (deleted during merge), but still consume disk space (delete on = last close POSIX semantics). If you also "forceMerge" your index (you should never ever do this), you = may temporarily need 3 times the index size. Uwe ----- Uwe Schindler Achterdiek 19, D-28357 Bremen http://www.thetaphi.de eMail: uwe@thetaphi.de > -----Original Message----- > From: Chris Bamford [mailto:chris@bammers.net] > Sent: Friday, March 3, 2017 7:24 PM > To: java-user@lucene.apache.org > Subject: Index size variation >=20 > Hello >=20 > I have observed that sometimes my index size temporarily increases by = a > large amount, presumably while it it merges segments. >=20 > Is there some documentation on this subject? I am trying to estimate = total > disk space I'll need for a project. >=20 > Thanks >=20 > Chris >=20 >=20 >=20 >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org