lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zheng Lin Edwin Yeo <edwinye...@gmail.com>
Subject Re: Solr performance is slow with just 1GB of data indexed
Date Wed, 26 Aug 2015 14:58:54 GMT
Thanks for your recommendation Toke.

Will try to ask in the carrot forum.

Regards,
Edwin

On 26 August 2015 at 18:45, Toke Eskildsen <te@statsbiblioteket.dk> wrote:

> On Wed, 2015-08-26 at 15:47 +0800, Zheng Lin Edwin Yeo wrote:
>
> > Now I've tried to increase the carrot.fragSize to 75 and
> > carrot.summarySnippets to 2, and set the carrot.produceSummary to
> > true. With this setting, I'm mostly able to get the cluster results
> > back within 2 to 3 seconds when I set rows=200. I'm still trying out
> > to see if the cluster labels are ok, but in theory do you think this
> > is a suitable setting to attempt to improve the clustering results and
> > at the same time improve the performance?
>
> I don't know - the quality/performance point as well as which knobs to
> tweak is extremely dependent on your corpus and your hardware. A person
> with better understanding of carrot might be able to do better sanity
> checking, but I am not at all at that level.
>
> Related, it seems to me that the question of how to tweak the clustering
> has little to do with Solr and a lot to do with carrot (assuming here
> that carrot is the bottleneck). You might have more success asking in a
> carrot forum?
>
>
> - Toke Eskildsen, State and University Library, Denmark
>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message