lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <>
Subject [jira] [Commented] (SOLR-2564) Integrating grouping module into Solr 4.0
Date Wed, 08 Jun 2011 17:23:59 GMT


Yonik Seeley commented on SOLR-2564:

bq. Another option is to allow the grouping module (separately from Lucene core) to use Java
6 code


bq. Yonik, how do you create the index used for this test? Somehow you generate an int field
w/ random 1000 unique values – do you have a client-side script you use to create random
docs in Solr?

I have some CSV files laying around that I reuse for ad-hoc testing of a lot of stuff.  They
were created with a simple python script.
Then I simply index with
curl "$URL/update/csv?stream.url=file:/tmp/test.csv&overwrite=false&commit=true"

It was also my first reaction to think that this is a very synthetic case that people are
unlikely to hit... until I thought about dates.  Indexing everything in date order is a pretty
common thing to do, and so is sorting by date - which hits the exact same case.  Queries of
*:* and simple filter queries on type, etc, also tend to be pretty common (i.e. full-text
relevance/performance actually isn't an important feature for some users).

How complex must queries be for caching to generate a net benefit under load? I haven't tried
to test this myself.

> Integrating grouping module into Solr 4.0
> -----------------------------------------
>                 Key: SOLR-2564
>                 URL:
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Martijn van Groningen
>            Assignee: Martijn van Groningen
>             Fix For: 4.0
>         Attachments: LUCENE-2564.patch, SOLR-2564.patch, SOLR-2564.patch, SOLR-2564.patch,
SOLR-2564.patch, SOLR-2564.patch
> Since work on grouping module is going well. I think it is time to wire this up in Solr.
> Besides the current grouping features Solr provides, Solr will then also support second
pass caching and total count based on groups.

This message is automatically generated by JIRA.
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message