Return-Path: X-Original-To: apmail-lucene-general-archive@www.apache.org Delivered-To: apmail-lucene-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8866018A1E for ; Mon, 23 Nov 2015 20:24:07 +0000 (UTC) Received: (qmail 11616 invoked by uid 500); 23 Nov 2015 20:24:06 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 11563 invoked by uid 500); 23 Nov 2015 20:24:06 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 11551 invoked by uid 99); 23 Nov 2015 20:24:06 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 23 Nov 2015 20:24:06 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id D652AC0DD5 for ; Mon, 23 Nov 2015 20:24:05 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.731 X-Spam-Level: X-Spam-Status: No, score=0.731 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, LOCALPART_IN_SUBJECT=0.73, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=playingwithpointers-com.20150623.gappssmtp.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id qkTDsmFA72Nb for ; Mon, 23 Nov 2015 20:23:50 +0000 (UTC) Received: from mail-pa0-f41.google.com (mail-pa0-f41.google.com [209.85.220.41]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 6B74620FE7 for ; Mon, 23 Nov 2015 20:23:49 +0000 (UTC) Received: by pacej9 with SMTP id ej9so201383884pac.2 for ; Mon, 23 Nov 2015 12:23:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=playingwithpointers-com.20150623.gappssmtp.com; s=20150623; h=message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:content-type:content-transfer-encoding; bh=awPNRjhUeCdpBLrcV3lE91O35TyurYZJ0aZLjv3Pu44=; b=add97TcF2ece/ldLHjHj+5QGS0ijnHiVXJ1q8b6vpGNO1I+YV0V7rP/xV5JMc5Fvp1 4qMGReQTw+lzsecBGZdIXQuC4nyTrJom8g1scbtkMHS/LiY8nUlfv2uJAQNFEkqBx+9H xOsLDckwRQqXCUo7Fm45X4GG/vaRkkO+eiWvjokUCyDWK3OG+sIA4PMLt1GMDDMCHkjC 5uBGBtl6jHDsNnebKacdvOvbns3fwMSOIJfAaHRy7Jfh/9FBnNeqmYC8eDhOXQq3tQSm 5dZfrXOaP6tQQz1tTcZWaicZZqCHlahNjUYZSYArZUT7xMUzSbc0lv5J7X9djgE13lFl sZ1w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:message-id:date:from:user-agent:mime-version:to :cc:subject:references:in-reply-to:content-type :content-transfer-encoding; bh=awPNRjhUeCdpBLrcV3lE91O35TyurYZJ0aZLjv3Pu44=; b=i8N1fqko6/W+koA2tvod6Z3dZaBWp38gw16uQHSfnsG1ipoSZq3SJa0vXzYT10kdR5 XEswy0lNxp8Pg5Lule3EHDsgv5BKVyDOy+KfWiY7ExwxGZA9THJeujauHy32lvu8iAf5 XaRU+VgMzYIkA7bJ/iW9no2TyBd/7TK4zJr4OQCHO44AZ1U2PqDWBvFNt4RlkxqA4ncD OGpoj/qZS6jczFRck2H3qqOp663wDrJkU9ExMTVkdUpiNTn39SkEiQDCmPvoUzjsbQO2 Y4Q0lbBuuep+oJeF5cMg6YCsfCvZIQB9ZlD8xVbindRQGjseM2mcmzU7W+0LqDtVmkno SZWg== X-Gm-Message-State: ALoCoQlibYa76kVatWxFiDOi2E3+04NGYzJOFp3kO4pz1vOqwNSz25x3YLaxn29RjHUmJH6E7qtH X-Received: by 10.98.9.194 with SMTP id 63mr18186835pfj.30.1448310228042; Mon, 23 Nov 2015 12:23:48 -0800 (PST) Received: from [10.10.7.252] (scooby.azul.com. [173.228.87.163]) by smtp.googlemail.com with ESMTPSA id iv5sm11480522pbb.24.2015.11.23.12.23.45 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 23 Nov 2015 12:23:46 -0800 (PST) Message-ID: <565375D0.7000304@playingwithpointers.com> Date: Mon, 23 Nov 2015 12:23:44 -0800 From: Sanjoy Das User-Agent: Postbox 4.0.8 (Macintosh/20151105) MIME-Version: 1.0 To: Michael McCandless CC: Lucene/Solr dev , "general@lucene.apache.org" Subject: Re: Benchmarking Lucene References: <56536C43.2010301@playingwithpointers.com> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Michael McCandless wrote: > Which JVM vendor :) There are not so many, unfortunately... I work for Azul Systems (https://www.azul.com). > I run nightly benchmarks for Lucene, which are visible at > https://people.apache.org/~mikemccand/lucenebench/ > > We use this to catch accidental performance regressions... the sources > for all of this are at https://github.com/mikemccand/luceneutil but > running them yourself can be tricky. They index and search > Wikipedia's English export. I was hoping to get hold of benchmarks that are a little more "lightweight" -- something that I can run from beginning to end in < 30 minutes. Is there an interesting subset of the nightly tests that I can run within that sort of timeframe? > Lucene is definitely JVM/GC bound in many cases, e.g. when the index > is "hot" (fully cached by the OS in free RAM). > > I'm not familiar with Dacapo... > > I'm not sure how aggressively users upgrade ... but I believe most > users use Lucene via Elasticsearch or Solr. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Mon, Nov 23, 2015 at 2:42 PM, Sanjoy Das > wrote: >> Hi all, >> >> I work for a JVM vendor, and we're interested in obtaining / creating >> a set of Lucene benchmarks for internal use. We plan to use these for >> performance regression testing and general performance analysis >> (i.e. to make sure Lucene performs well on our JVM). I'm especially >> interested in benchmarks that demonstrate opportunities for >> improvements in our JIT compiler. >> >> While I imagine that the lucene/benchmark/ directory is probably the >> right place to start, I have a few high-level questions that are best >> answered by people on this mailing list: >> >> - Are there realistic Lucene workloads that are bottle-necked on the >> JVM's performance (JIT, GC etc.) and *not* e.g. disk / network IO? >> If so, what are some examples? >> >> - How relevant are the Dacapo "luindex" and "lusearch" benchmarks >> today? Will porting them to the latest version of Lucene give me a >> benchmark representative of modern Lucene usage, or has Lucene's >> performance characteristics evolved in fundamental ways since Dacapo >> was published? >> >> - What is the distribution of Lucene versions in production >> deployments? Do users tend to aggressively upgrade to the "latest >> and greatest" Lucene version, or is there usually a non-trivial lag? >> >> Any other information that you think is useful or relevant is >> welcome. >> >> Thanks! >> -- Sanjoy >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org >> For additional commands, e-mail: dev-help@lucene.apache.org >>