lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Christian Moen (Commented) (JIRA)" <>
Subject [jira] [Commented] (SOLR-3282) Perform Kuromoji/Japanese stability test before 3.6 freeze
Date Tue, 10 Apr 2012 15:37:19 GMT


Christian Moen commented on SOLR-3282:

I'll resolve this issue now.

I've also been doing additional testing using the VisualVM Visual GC plugin and I'm seeing
that the {{org.apache.lucene.analysis.ja.Token}} objects gets collected fairly as we expect.
 In actual deployments, it's perhaps a good idea to use a larger eden space by using the server
GC defaults or tune things up.

In longer term tests, it seems like Solr's heap-space is being used also in the case of a
512MB heap before a full GC recovered lots of heap.  I suspect this might be caused by searching
with highlighting and the heap seems very stable with indexing only.  (In either case, this
doesn't seem to be caused by Kuromoji.)

> Perform Kuromoji/Japanese stability test before 3.6 freeze
> ----------------------------------------------------------
>                 Key: SOLR-3282
>                 URL:
>             Project: Solr
>          Issue Type: Task
>          Components: Schema and Analysis
>    Affects Versions: 3.6, 4.0
>            Reporter: Christian Moen
>            Assignee: Christian Moen
>         Attachments: 250k-queries-no-highlight-gc.log, 250k-queries-no-highlight-visualvm.png,
62k-queries-highlight-gc.log, 62k-queries-highlight-visualvm.png, jawiki-index-gc.log, jawiki-index-gcviewer.png,
jawiki-index-visualvm.png, long-query-indexing-gc.log, long-search-indexing-visualvm.png
> Kuromoji might be used by many and also in mission critical systems.  I'd like to run
a stability test before we freeze 3.6.
> My thinking is to test the out-of-the-box configuration using fieldtype {{text_ja}} as
> # Index all of Japanese Wikipedia documents (approx. 1.4M documents) in a never ending
> # Simultaneously run many tens of thousands typical Japanese queries against the index
at 3-5 queries per second with highlighting turned on
> While Solr is indexing and searching, I'd like to verify that:
> * Indexing and queries are working as expected
> * Memory and heap usage looks stable over time
> * Garbage collection is overall low over time -- no Full-GC issues
> I'll post findings and results to this JIRA.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message