lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: SolrCloud performance issues regarding hardware configuration
Date Fri, 18 Jul 2014 15:10:29 GMT
Right, this is the worst kind of use-case for faceting. You have
150M docs/shard and are asking up to 125M buckets to count
into, plus control structures. Performance of this (even without OOMs)
will be a problem. Having multiple queries execute this simultaneously
will increase memory usage.

So your choices are either to increase memory (a lot) or not do this.
It's a valid question whether this is useful information to present to a
user
(or are you doing some kind of analytics here?).

This feels like an XY problem. What is the use-case you're trying to
support with this?

Best,
Erick



On Fri, Jul 18, 2014 at 5:04 AM, search engn dev <sachinyadav0025@gmail.com>
wrote:

> out of 700 million documents 95-97% values are unique approx.
>
> My facet query is :
>
> http://localhost:8983/solr/select?q=*:*&rows=0&facet=true&facet.limit=10000&facet.field=user_digest
>
> Above query throws OOM exception as soon as fire it to solr.
>
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/SolrCloud-performance-issues-regarding-hardware-configuration-tp4147843p4147871.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message