From "Ishan Chattopadhyaya (JIRA)" <>
Subject [jira] [Commented] (SOLR-10317) Solr Nightly Benchmarks
Date Wed, 29 Mar 2017 16:42:41 GMT


Ishan Chattopadhyaya commented on SOLR-10317:

Here's a rough list of the top of my head. It would be good for a student to add to this list
whatever I've missed out for the sake of completeness:
# Indexing benchmarks
## Standalone
## SolrCloud (various simple configurations (0) )
## new replication mode
# Various types of queries:
## Querying on numeric fields (exact queries, range queries)
## Querying on text fields
## Querying on string fields
## Sorting on numeric fields, string fields (with and without docValues)
## Extended Dismax queries
## Spatial search (using various strategies)
# Query (all the above) on
## Standalone Solr
## SolrCloud (on some simple configurations (0) )
## Also, good if this can be tried out on the new replication mode (SOLR-9835).
# Partial Updates benchmarks (atomic updates, in-place updates)
# Faceting (string fields, numeric fields, enum fields)
# Grouping (string fields, numeric fields, enum fields)
# Spell check

A Wikipedia based dataset is usually available on all the Jenkins instances, and could be
used for the purpose. [~steve_rowe], [~thetaphi], can you please point to the downloadable
link for the enwiki.random.lines.txt file? (I have it, but forgot where I got it from).

If I've missed out something, please feel free to comment.

(0) - Some simple SolrCloud configurations could be:
# 1 shard, 2-3 replicas
# 2 shards, 1 replica each
# 2 shards, 2 replicas each

