lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael McCandless <luc...@mikemccandless.com>
Subject Re: Performance testing Lucene
Date Fri, 24 Jan 2014 10:45:10 GMT
Oh that's good to hear.  Lucene's unit tests are quite stressful on a
new Directory impl...

Mike McCandless

http://blog.mikemccandless.com


On Thu, Jan 23, 2014 at 8:40 PM, Scott Schneider
<Scott_Schneider@symantec.com> wrote:
> Thanks!  I ran this Directory subclass through the Lucene unit tests (and found 3 race
conditions).  Unit tests are wonderful.
>
> Scott
>
>
>> -----Original Message-----
>> From: Michael McCandless [mailto:lucene@mikemccandless.com]
>> Sent: Wednesday, January 22, 2014 7:05 AM
>> To: Lucene Users
>> Subject: Re: Performance testing Lucene
>>
>> All the source code for the nightly Lucene perf tests I run (
>> http://people.apache.org/~mikemccand/lucenebench/ ) are here:
>> https://code.google.com/a/apache-extras.org/p/luceneutil/
>>
>> These are also the scripts I use for A/B performance tests for a new
>> patch.
>>
>> It's somewhat tricky getting those Python scripts set up to run ...
>> but I think it'd be a good way to smoke test your new Directory.
>>
>> The queries are "synthetic"; it's a real problem, not having a real
>> world, biggish corpus plus real queries, for better performance
>> testing...
>>
>> Mike McCandless
>>
>> http://blog.mikemccandless.com
>>
>>
>> On Mon, Jan 20, 2014 at 11:22 PM, Scott Schneider
>> <Scott_Schneider@symantec.com> wrote:
>> > Hello,
>> >
>> > Would you folks mind giving me a few tips on performance testing
>> Lucene?  I want to test the performance impact of a Directory subclass.
>> >
>> > What is a good testing tool to use?  I don't see a great way to get
>> SolrMeter to run the max # updates/minute and measure throughput that
>> way.  When I set the # updates/minute to a large #, SolrMeter logs
>> NullPointerExceptions.  (I assume these are within SolrMeter, as I
>> don't see errors in Solr.)  Mike McCandless's nightly Lucene
>> performance tests look good, though I've only just started looking at
>> it.
>> >
>> > Are there any particularly standard or good test sets?  I'd like to
>> test 3 scenarios:  indexing only, querying only, and indexing plus
>> querying.  McCandless's indexing test uses wikipedia, which seems
>> great, but he has a slew of tests that are each specific to some
>> querying feature.  I'd like a single, general query test.  It's not
>> hard to come up with a decent set of queries, but I'd really like
>> something representative of real world queries.  If there some standard
>> set of commonly used queries, that would be ideal.
>> >
>> > Thanks!
>> >
>> > Scott
>> >
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message