Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 02EA317AFF for ; Sat, 14 Mar 2015 11:39:40 +0000 (UTC) Received: (qmail 20426 invoked by uid 500); 14 Mar 2015 11:39:39 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 20364 invoked by uid 500); 14 Mar 2015 11:39:39 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 20353 invoked by uid 99); 14 Mar 2015 11:39:38 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Mar 2015 11:39:38 +0000 Date: Sat, 14 Mar 2015 11:39:38 +0000 (UTC) From: "Damien Kamerman (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (SOLR-7191) Improve stability and startup performance of SolrCloud with thousands of collections MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SOLR-7191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damien Kamerman updated SOLR-7191: ---------------------------------- Attachment: SOLR-7191.patch This fix takes some load off the DistributedQueue. On shutdown the node will publish a 'down' state for the entire node and overseer will do the rest. Registration is done in a fixed size (128) threadPool in corename order. > Improve stability and startup performance of SolrCloud with thousands of collections > ------------------------------------------------------------------------------------ > > Key: SOLR-7191 > URL: https://issues.apache.org/jira/browse/SOLR-7191 > Project: Solr > Issue Type: Bug > Components: SolrCloud > Affects Versions: 5.0 > Reporter: Shawn Heisey > Labels: performance, scalability > Attachments: SOLR-7191.patch, SOLR-7191.patch, lots-of-zkstatereader-updates-branch_5x.log > > > A user on the mailing list with thousands of collections (5000 on 4.10.3, 4000 on 5.0) is having severe problems with getting Solr to restart. > I tried as hard as I could to duplicate the user setup, but I ran into many problems myself even before I was able to get 4000 collections created on a 5.0 example cloud setup. Restarting Solr takes a very long time, and it is not very stable once it's up and running. > This kind of setup is very much pushing the envelope on SolrCloud performance and scalability. It doesn't help that I'm running both Solr nodes on one machine (I started with 'bin/solr -e cloud') and that ZK is embedded. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org