Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0371611D27 for ; Mon, 4 Aug 2014 12:52:52 +0000 (UTC) Received: (qmail 23192 invoked by uid 500); 4 Aug 2014 12:52:48 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 23121 invoked by uid 500); 4 Aug 2014 12:52:48 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 23107 invoked by uid 99); 4 Aug 2014 12:52:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Aug 2014 12:52:48 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yseeley@gmail.com designates 209.85.214.169 as permitted sender) Received: from [209.85.214.169] (HELO mail-ob0-f169.google.com) (209.85.214.169) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Aug 2014 12:52:44 +0000 Received: by mail-ob0-f169.google.com with SMTP id nu7so4470442obb.28 for ; Mon, 04 Aug 2014 05:52:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=paIXtXzm/A5/Ag6oe1ogEGSNXqkNgiNP40+jlSVrF4k=; b=QHol5jJpc+3eI/Xr9AjFm3k1uCbJjD4H1V2PXNYObWgwswqWLUghKbw4Ixb2es1T08 GYTcNgpnPc4aczeaF2IDbK/HZdo8mQbWnpG2HwSmRD1fuI8LSxr1FSQRvso4Z1YA99i3 AsK9EIfWA/sKc27hL5Rd2jmkCVlhUciyswgNvxHTPdsCnwPBIlwOtIKDK9mhk83QL9Iq WAdDjJLmt4s6crpEODznJcDpqtp0/45c55XXmp5I3z5omjSFqHi0og22PYzEw1dE7gWe 9Acfkeg8ouIeC1EmgMtrOVFJKDk6VUBVwzEgrFcbfHoXjcE7fNbI30hek6IS1yH5e7gb 0qnA== MIME-Version: 1.0 X-Received: by 10.60.179.114 with SMTP id df18mr4322436oec.76.1407156741097; Mon, 04 Aug 2014 05:52:21 -0700 (PDT) Sender: yseeley@gmail.com Received: by 10.182.152.98 with HTTP; Mon, 4 Aug 2014 05:52:21 -0700 (PDT) In-Reply-To: References: <4DE5A4E6.8030302@gmail.com> <1311690781414-3200492.post@n3.nabble.com> <53DF28D9.7050401@raytion.com> Date: Mon, 4 Aug 2014 08:52:21 -0400 X-Google-Sender-Auth: 7q3n8Vx6PD350vOEV7Jsgv6KGDY Message-ID: Subject: Re: Solr vs ElasticSearch From: Yonik Seeley To: "solr-user@lucene.apache.org" Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org On Mon, Aug 4, 2014 at 2:43 AM, Alexandre Rafalovitch wrote: > That resource is rather superficial. I wouldn't make big decision based on it. Agree. It's also somewhat biased given the environment in which it grew. ES advocates were all over stuff like that, but Solr advocates were less vocal. Qualitatively: Solr facets and function queries were faster for ages (no idea if they still are or not...). Solr's faceting took up far less memory (that's probably changed too)... but no mention. Solr had efficient deep paging first, but most assume it was the other way around: https://github.com/elasticsearch/elasticsearch/issues/4940 Solr's "function queries" were far faster - I evaluated the mvel scripting language used by ES for this stuff... it was dog slow. Some something more concrete: Solr's faceting gives exact counts for the constraints returned, while ES still does not (it still does a naive "sum top N from each shard".) Some things in the table are just wrong: - Under "joins" for Solr, it says "It's not supported in distributed search.", yet ES has the exact same limitations... joined docs must be on the same shard (and provided that is true, joins are both supported in Solr and ES). - The comment for "Negative Boosting" is just wrong. It is supported. - "Online schema changes" is incorrect for Solr - it is supported. - "Structured Query DSL"... yes, we've had it forever. No it's not JSON. - "Advanced Faceting" is simply a "no" under solr and a "yes" under ES... this is incorrect. The tooltip says "metrics and bucketing", which solr has had forever (facet stats) that tons of people have used to build BI tools. Heliosearch adds even more of course. There are probably things wrong on the ES side too of course. But then at the bottom some of the things in "Thoughts..." are unfair and biasing... """As Matt Weber points out below, ElasticSearch was built to be distributed from the ground up, not tacked on as an 'afterthought' like it was with Solr. This is totally evident when examining the design and architecture of the 2 products, and also when browsing the source code.""" That's from a well known ES advocate of course. But software, just like arguments, should be evaluated in it's merits. -Yonik http://heliosearch.org - native code faceting, facet functions, sub-facets, off-heap data