Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CE03C11A5C for ; Sun, 6 Jul 2014 01:48:03 +0000 (UTC) Received: (qmail 74602 invoked by uid 500); 6 Jul 2014 01:47:57 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 74519 invoked by uid 500); 6 Jul 2014 01:47:57 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 74507 invoked by uid 99); 6 Jul 2014 01:47:57 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Jul 2014 01:47:57 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kireet@feedly.com designates 209.85.192.173 as permitted sender) Received: from [209.85.192.173] (HELO mail-pd0-f173.google.com) (209.85.192.173) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Jul 2014 01:47:50 +0000 Received: by mail-pd0-f173.google.com with SMTP id r10so3529106pdi.18 for ; Sat, 05 Jul 2014 18:47:30 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:content-type:content-transfer-encoding :subject:message-id:date:to:mime-version; bh=Oa1T4NrF/Q5PA6IPXibGbc+XGJ6+x5xHZz5YXrAWOJI=; b=Bv0rPsGkKRsiwsVX1dIZytpSsEL5w60uHplCrdOV6x3GaqtvhJRJN1U0Co5XE9NBUa WIyLOzvbiS+lv4iti9IBEbHwOoUaq9d61RPYyLK0sSp66X7ZErp1/bP/kX7y7azBcSc5 op7V4fTKbmouWPGZz2fuyFnMO+CZufFW2KUwRmU6Ta1PzWKrG40cQZ+NIDhGMKMn4QxT jDDVqVLsiF/99dmbhx0QnR+XxGBFxPinZmKYfi/PVSFHaRSnnW6UFO5vH2SgYpFW2wE/ 7ds0LdX8Wd3yy+vyjrmFzFDzvZ9rRmg54azkm/RmucQjyk3cYJaKQCDxJvSMau7yk4+B KrYg== X-Gm-Message-State: ALoCoQnC538ea5PwSNH3zH2tZN4Te2gSTxKnUvGVwEabup7m9L7yiBOQkrS/dIXBiLHDP5elbbb6 X-Received: by 10.66.163.164 with SMTP id yj4mr19749760pab.91.1404611250469; Sat, 05 Jul 2014 18:47:30 -0700 (PDT) Received: from [10.0.0.2] ([76.102.141.137]) by mx.google.com with ESMTPSA id f2sm173293831pat.11.2014.07.05.18.47.28 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sat, 05 Jul 2014 18:47:29 -0700 (PDT) From: Kireet Reddy Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: TieredMergePolicy Message-Id: <58943385-082A-470D-9BF2-86E256804766@feedly.com> Date: Sat, 5 Jul 2014 18:47:28 -0700 To: solr-user@lucene.apache.org Mime-Version: 1.0 (Mac OS X Mail 7.3 \(1878.2\)) X-Mailer: Apple Mail (2.1878.2) X-Virus-Checked: Checked by ClamAV on apache.org I have a question about the maxMergeAtOnce parameter. We are using = elasticsearch and one of our nodes seems to have very high merge = activity, However it seems to be high CPU activity and not I/O = constrainted. I have enabled the IndexWriter info stream logs, and often = times it seems to do merges of quite small segments (100KB) that are = much below the floor size (2MB). I suspect this is due to frequent = refreshes and/or using lots of threads concurrently to do indexing.=20 My supposition is that this is leading to the merge policy doing lots of = merges of very small segments into another small segment which will = again require a merge to even reach the floor size. My index has 64 = segments and 25 are below the floor size. I am wondering if there should = be an exception for the maxMergesAtOnce parameter for the first level so = that many small segments could be merged at once in this case? I am considering changing the other parameters (wider tiers, lower floor = size, more concurrent merges allowed) but these all seem to have side = effects I may not necessarily want. Is there a good solution here?=