lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Scott Schneider <Scott_Schnei...@symantec.com>
Subject RE: Performance testing Lucene
Date Fri, 24 Jan 2014 01:40:54 GMT
Thanks!  I ran this Directory subclass through the Lucene unit tests (and found 3 race conditions).
 Unit tests are wonderful.

Scott
 

> -----Original Message-----
> From: Michael McCandless [mailto:lucene@mikemccandless.com]
> Sent: Wednesday, January 22, 2014 7:05 AM
> To: Lucene Users
> Subject: Re: Performance testing Lucene
> 
> All the source code for the nightly Lucene perf tests I run (
> http://people.apache.org/~mikemccand/lucenebench/ ) are here:
> https://code.google.com/a/apache-extras.org/p/luceneutil/
> 
> These are also the scripts I use for A/B performance tests for a new
> patch.
> 
> It's somewhat tricky getting those Python scripts set up to run ...
> but I think it'd be a good way to smoke test your new Directory.
> 
> The queries are "synthetic"; it's a real problem, not having a real
> world, biggish corpus plus real queries, for better performance
> testing...
> 
> Mike McCandless
> 
> http://blog.mikemccandless.com
> 
> 
> On Mon, Jan 20, 2014 at 11:22 PM, Scott Schneider
> <Scott_Schneider@symantec.com> wrote:
> > Hello,
> >
> > Would you folks mind giving me a few tips on performance testing
> Lucene?  I want to test the performance impact of a Directory subclass.
> >
> > What is a good testing tool to use?  I don't see a great way to get
> SolrMeter to run the max # updates/minute and measure throughput that
> way.  When I set the # updates/minute to a large #, SolrMeter logs
> NullPointerExceptions.  (I assume these are within SolrMeter, as I
> don't see errors in Solr.)  Mike McCandless's nightly Lucene
> performance tests look good, though I've only just started looking at
> it.
> >
> > Are there any particularly standard or good test sets?  I'd like to
> test 3 scenarios:  indexing only, querying only, and indexing plus
> querying.  McCandless's indexing test uses wikipedia, which seems
> great, but he has a slew of tests that are each specific to some
> querying feature.  I'd like a single, general query test.  It's not
> hard to come up with a decent set of queries, but I'd really like
> something representative of real world queries.  If there some standard
> set of commonly used queries, that would be ideal.
> >
> > Thanks!
> >
> > Scott
> >
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message