Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 42E11200B8A for ; Sat, 24 Sep 2016 19:48:56 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 41811160AD1; Sat, 24 Sep 2016 17:48:56 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 7D317160AC2 for ; Sat, 24 Sep 2016 19:48:55 +0200 (CEST) Received: (qmail 45672 invoked by uid 500); 24 Sep 2016 17:48:53 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 45661 invoked by uid 99); 24 Sep 2016 17:48:53 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 24 Sep 2016 17:48:53 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id E21F01A0467 for ; Sat, 24 Sep 2016 17:48:52 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.426 X-Spam-Level: X-Spam-Status: No, score=-0.426 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_NONE=-0.0001, RP_MATCHES_RCVD=-1.426] autolearn=disabled Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ifxLcY-cZ4kA for ; Sat, 24 Sep 2016 17:48:51 +0000 (UTC) Received: from sbexch04.sb.statsbiblioteket.dk (sbexch04.sb.statsbiblioteket.dk [130.225.24.70]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id 41F525F2F2 for ; Sat, 24 Sep 2016 17:48:51 +0000 (UTC) Received: from sbexch04.sb.statsbiblioteket.dk (130.225.24.70) by sbexch04.sb.statsbiblioteket.dk (130.225.24.70) with Microsoft SMTP Server (TLS) id 15.0.1210.3; Sat, 24 Sep 2016 19:48:50 +0200 Received: from sbexch04.sb.statsbiblioteket.dk ([fe80::84ce:82da:4b03:e7d4]) by sbexch04.sb.statsbiblioteket.dk ([fe80::84ce:82da:4b03:e7d4%14]) with mapi id 15.00.1210.000; Sat, 24 Sep 2016 19:48:50 +0200 From: Toke Eskildsen To: solr_user lucene_apache Subject: Re: Whether SolrCloud can support 2 TB data? Thread-Topic: Whether SolrCloud can support 2 TB data? Thread-Index: AQHSFd1SXziuzYmuxE+aZ5u6US69pKCHrE0v///zuACAAHIBAIAAfOOAgABUEGs= Date: Sat, 24 Sep 2016 17:48:50 +0000 Message-ID: <1474739330118.39784@statsbiblioteket.dk> References: <1474662805089-4297790.post@n3.nabble.com> <1474664334795-4297800.post@n3.nabble.com> <27d64747-4417-4c35-8c4e-d8028060a4fa@Spark> , In-Reply-To: Accept-Language: en-GB, da-DK, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [188.183.66.166] Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 archived-at: Sat, 24 Sep 2016 17:48:56 -0000 Regarding a 12TB index:=0A= =0A= Yago Riveiro wrote:=0A= =0A= > Our cluster is small for the data we hold (12 machines with SSD and 32G o= f=0A= > RAM), but we don't need sub-second queries, we need facet with high=0A= > cardinality (in worst case scenarios we aggregate 5M unique string values= )=0A= =0A= > In a peak of inserts we can handle around 25K docs per second without any= =0A= > issue with 2 replicas and without compromise reads or put a node in stres= s.=0A= > Nodes in stress can eject him selfs from the Zookepeer cluster due a GC o= r a=0A= > lack of CPU to communicate.=0A= =0A= I am surprised that you manage to have this working on that hardware. As yo= u have replicas, it seems to me that you handle 2*12TB of index with 12*32G= B of RAM? This is very close to our setup (22TB of index with 320GB of RAM = (updated last week from 256GB) per machine), but we benefit hugely from hav= ing a static index.=0A= =0A= I assume the SSDs are local? How much memory do you use for heap on each ma= chine?=0A= =0A= - Toke Eskildsen=0A=