lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Christian Moen (Commented) (JIRA)" <>
Subject [jira] [Commented] (SOLR-3282) Perform Kuromoji/Japanese stability test before 3.6 freeze
Date Wed, 28 Mar 2012 02:45:22 GMT


Christian Moen commented on SOLR-3282:

h3. Summary

Without spending too much time interpreting details of this little test, I think Kuromoji
looks stable and ready for release.

I also think it's very nice that Solr 3.6 can index Japanese Wikipedia (~1.4 million docs)
continuously while serving unique user queries at 10 QPS on a laptop with using only 256MB
heap space.

Anyone interested, please feel to add your comments and interpretations of the the results.
> Perform Kuromoji/Japanese stability test before 3.6 freeze
> ----------------------------------------------------------
>                 Key: SOLR-3282
>                 URL:
>             Project: Solr
>          Issue Type: Task
>          Components: Schema and Analysis
>    Affects Versions: 3.6, 4.0
>            Reporter: Christian Moen
>            Assignee: Christian Moen
>         Attachments: 250k-queries-no-highlight-gc.log, 250k-queries-no-highlight-visualvm.png,
62k-queries-highlight-gc.log, 62k-queries-highlight-visualvm.png, jawiki-index-gc.log, jawiki-index-gcviewer.png,
jawiki-index-visualvm.png, long-query-indexing-gc.log, long-search-indexing-visualvm.png
> Kuromoji might be used by many and also in mission critical systems.  I'd like to run
a stability test before we freeze 3.6.
> My thinking is to test the out-of-the-box configuration using fieldtype {{text_ja}} as
> # Index all of Japanese Wikipedia documents (approx. 1.4M documents) in a never ending
> # Simultaneously run many tens of thousands typical Japanese queries against the index
at 3-5 queries per second with highlighting turned on
> While Solr is indexing and searching, I'd like to verify that:
> * Indexing and queries are working as expected
> * Memory and heap usage looks stable over time
> * Garbage collection is overall low over time -- no Full-GC issues
> I'll post findings and results to this JIRA.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message