Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6D336F1EA for ; Fri, 25 Apr 2014 16:38:01 +0000 (UTC) Received: (qmail 54326 invoked by uid 500); 25 Apr 2014 16:37:57 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 54258 invoked by uid 500); 25 Apr 2014 16:37:56 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 54250 invoked by uid 99); 25 Apr 2014 16:37:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Apr 2014 16:37:56 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mukh.007@gmail.com designates 209.85.216.175 as permitted sender) Received: from [209.85.216.175] (HELO mail-qc0-f175.google.com) (209.85.216.175) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Apr 2014 16:37:51 +0000 Received: by mail-qc0-f175.google.com with SMTP id e16so4217515qcx.6 for ; Fri, 25 Apr 2014 09:37:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=C9k4X1P5LS5F7bxf79+605PjiGZsTtjvBhuCr6J61zw=; b=DJsJlWeGmAgnREZqp9XrL/bSGaWq1vPazzZDbPY8jJ9FUdfdFNmFG3dz6IVOhLphlZ mGrcLiv3um9nkm14V2P4LO7Et596yEiUe1SyPFuAx/1GCJVnMyjjsGejhLGDRMUv8QbU YOTNEe/QoVdVeoYpctR27PdrAX0p9FkaoMuCqrY34xAZxDyHPkVfV+OuJjUgXKCxjISr wTn5a6Hi4lqbwCnM3L/6famJtoAlHSANjEzWNZAsIqaupPCFYyJtDuPUK44w49wp8XKH HvHsasqtIC9942DbsMEJ4rrgQGuVkM9JOd8dhLUtVexq9G2dLz5SiPKFZwjKWw1ao2q/ 7/OQ== MIME-Version: 1.0 X-Received: by 10.224.165.139 with SMTP id i11mr13175329qay.94.1398443850412; Fri, 25 Apr 2014 09:37:30 -0700 (PDT) Sender: mukh.007@gmail.com Received: by 10.224.119.81 with HTTP; Fri, 25 Apr 2014 09:37:30 -0700 (PDT) In-Reply-To: References: Date: Fri, 25 Apr 2014 22:07:30 +0530 X-Google-Sender-Auth: xyegNSe5Ha8WBH_w4_i5WVaUGxg Message-ID: Subject: Re: Solr Cluster management having too many cores From: Mukesh Jha To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=089e0149ce32f1362e04f7e094d4 X-Virus-Checked: Checked by ClamAV on apache.org --089e0149ce32f1362e04f7e094d4 Content-Type: text/plain; charset=ISO-8859-1 Thanks for quick reply Erik, I want to keep my collections till I run out of hardware, which is at least a couple of years worth data. I'd like to know more on ageing out aliases, did a quick search but didn't find much. On Fri, Apr 25, 2014 at 9:45 PM, Erick Erickson wrote: > Hmmm, tell us a little more about your use-case. In particular, how > long do you need to keep the data around? Days? Months? Years? > > Because if you only need to keep the data for a specified period, you > can use the collection aliasing process to age-out collections and > keep the number of cores from growing too large. > > Best, > Erick > > On Fri, Apr 25, 2014 at 6:49 AM, Mukesh Jha > wrote: > > Hi Experts, > > > > I need to divide my indexes based on hour/day with each index having > ~50-80 > > GB data & ~50-80 mill docs, so I'm planning to create daily collection > with > > names e.g. *sample_colledction_yyyy_mm_dd_hh.* > > I'll also create an alias *sample_collection* and update it whenever I > will > > create a new collection so that the entire data set is searchable. > > > > I've a couple of question on the above design > > 1) How far can it scale? As my collections will increase (so will the > > shards & replicas) do we have a breaking point when adding more/searching > > will become an issue? > > 2) As my cluster will grow because of huge number of collections the > > clusterstate.json file present in zookeeper will grow too, won't this be > a > > limiting factor? If so instead of storing all this info in one > > clusterstate.json file shouldn't Solr save cluster specific details in > this > > file & have collection specific config files present on zookeeper? > > 3) How can I easily manage all these collections? Do we have Java > Coreadmin > > API's available. I cannot find much documented on it. > > > > -- > > Txz, > > > > *Mukesh Jha * > -- Thanks & Regards, *Mukesh Jha * --089e0149ce32f1362e04f7e094d4--