Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 2B401200CA6 for ; Tue, 13 Jun 2017 19:12:53 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 29AAB160BDC; Tue, 13 Jun 2017 17:12:53 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 6FFFE160BC5 for ; Tue, 13 Jun 2017 19:12:52 +0200 (CEST) Received: (qmail 72780 invoked by uid 500); 13 Jun 2017 17:12:50 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 72768 invoked by uid 99); 13 Jun 2017 17:12:50 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Jun 2017 17:12:50 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 0DA0CC047C for ; Tue, 13 Jun 2017 17:12:50 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.666 X-Spam-Level: X-Spam-Status: No, score=0.666 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.796, SPF_PASS=-0.001, URI_HEX=1.313] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id cmpAeBrqCDw6 for ; Tue, 13 Jun 2017 17:12:48 +0000 (UTC) Received: from mail-wr0-f171.google.com (mail-wr0-f171.google.com [209.85.128.171]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id EE0D75F523 for ; Tue, 13 Jun 2017 17:12:47 +0000 (UTC) Received: by mail-wr0-f171.google.com with SMTP id 77so5384668wrb.1 for ; Tue, 13 Jun 2017 10:12:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=iUhf371T5JI2YhjwevgvuAOhru+FmN6MTDizG0XqfiA=; b=GnUTP94zPh/l7tFgu1OiidiacHbBkzslJM05alvRQ92cdEQm3yauXzcBYps1RFGMMt Fg3Fs7AFsQZ9Cj2Ca806JJrKJKi4k4GHreIlytlBje6n3bvKpRkbyUxw3Wg/wI/gTHT4 nbaBVOHXEMF8LnMJESA6myYZKFsVvFwMmgdlDko9a1ohxMgpj7woSa4ghykpInGNA+T1 mJOW6GeADjhgo2W3+PoNK7uIGr8OpBQn4DRuQIumhsruNpRSTBLCvXHi9byOqP2UBdub LOVDKVZIWyN2mfpyVeO2Rh0BKRMwEZUWYJx9tBEOdTpQlYJHe/o/iEXKnnoOcGvkr2L/ 94Iw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=iUhf371T5JI2YhjwevgvuAOhru+FmN6MTDizG0XqfiA=; b=WyRLMdPFijbIcC96Wx021edLsrYBr1M3EL/NE1bVKpOpnzoTGLxuIngxRFb+UYfYiu NyD+IRBrcaag15de+mgcfCt2PGpq5PVkznA2FXt0kvr7KK0ymhdFp388FKyW5Uv5O0ia CQ1IXSt7UBA9BzGPcnmzui3/Je8vCe1Gi98NzG7He8W2hC5M6adnOqjdb8SNDzK+t0Fm 1jFsb1SjfDYFHHLpxngUU4b+L26qc36AKqUaiHETFr7S92RTralCFJr8wRmgQ6x2Koby IFYkJ/scz9XbZMiwHz5SoREo4sSgQCdiqGKetXPPfaMvj7ltAzdxk8Cm4WfofHtSbuBU M+6g== X-Gm-Message-State: AKS2vOz/lPEBbjfSWSXFCx16ohZ/M7OFqlB0Rd7jzteOENbn2OEEbgc5 mDWOZIsA+cxvEwWL3gSBdySn1p+kVQ== X-Received: by 10.80.139.145 with SMTP id m17mr716022edm.121.1497373967587; Tue, 13 Jun 2017 10:12:47 -0700 (PDT) MIME-Version: 1.0 Received: by 10.80.179.58 with HTTP; Tue, 13 Jun 2017 10:12:47 -0700 (PDT) In-Reply-To: <1497371709549-4340377.post@n3.nabble.com> References: <1497371709549-4340377.post@n3.nabble.com> From: Susheel Kumar Date: Tue, 13 Jun 2017 13:12:47 -0400 Message-ID: Subject: Re: Multi tenant setup To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary="f403045c19906f21700551da8bf5" archived-at: Tue, 13 Jun 2017 17:12:53 -0000 --f403045c19906f21700551da8bf5 Content-Type: text/plain; charset="UTF-8" Going with single cluster having multiple collections (for each client) is what I would try. How many clients do you have? If 10K, mean 10K collections and then how many documents, their size etc. you will need to come up with to nail down #machines and their memory/cpu requirements. Going with single collection is not really a multi-tenant setup and also when you have different schema's. Thanks, Susheel On Tue, Jun 13, 2017 at 12:35 PM, Zisis T. wrote: > I'm trying to setup a multi-tenant Solr cluster (v6.5.1) which must meet > the > following requirements. The tenants are different customers with similar > type of data. > > * Ability to query per client but also across all clients > * Don't want to hit all shards for all type of requests (per client, across > clients) > * Don't want to have everything under a single multi-sharded collection to > avoid a SPOF and maintenance headaches > (e.g. a schema change will force an all-client reindexing. single huge > backup/restore) > * Ability to semi-support different schemas. > > Based on the above I ruled out the following setups > * Single multi-sharded collection for all clients and all its variations > (e.g. multiple clients in a singe shard) > * One collection per client > > My preference lies in a setup like the following > * Create a limited # of collections > * Split the clients in the collections created above based on some criteria > (size, content-type) > * Client specific requests will be limited in a single collection > * Across clients requests will target a limited # of collections (using > &collection=col_1,col_2,col_3) > > The approach above meets the requirements posted above but the issue that > is > blocking me is the Distributed IDF not working properly across collections. > (Check comment#3, bullet#2 of > http://lucene.472066.n3.nabble.com/Distributed-IDF-in- > inter-collections-distributed-queries-td4317519.html) > > > -> Do you see anything wrong with my assumptions/approach above? Are there > any alternatives besides having separate clusters for the search across > clients and the individual clients? > -> Is it safe to go with a single collection? If it is, I still need to > handle the possible different schemas per client somehow. > -> Is there a way to enforce local stats when quering a single collection > and use global stats only when querying across collections? (see link > above) > > Thanks > > > > -- > View this message in context: http://lucene.472066.n3. > nabble.com/Multi-tenant-setup-tp4340377.html > Sent from the Solr - User mailing list archive at Nabble.com. > --f403045c19906f21700551da8bf5--