lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Rutherglen (JIRA)" <j...@apache.org>
Subject [jira] Updated: (LUCENE-1577) Benchmark of different in RAM realtime techniques
Date Fri, 27 Mar 2009 20:04:50 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jason Rutherglen updated LUCENE-1577:
-------------------------------------

    Attachment: LUCENE-1577.patch

This patch performs a benchmark of 3 different techniques for RAM based realtime indexing
where after an update, the new document is searchable.  It performs multiple rounds of indexing
and calculates the percentage difference with fastest of each of the 3 techniques.  The document
source is the Wikipedia English XML used by contrib/benchmark.

* RealtimeWriter uses InstantiatedIndex
* LuceneWriter adds documents to an IndexWriter
* LuceneRealtimeWriter creates a RAMDirectory, opens an IndexWriter, adds a document, then
closes the writer.

I found it odd that RealtimeWriter is faster than LuceneWriter and so perhaps the benchmark
is incorrect somehow.  Otherwise the results look highly promising in that we can implement
realtime search with no impact to existing indexing performance.  

Summary of the results:

numRounds:3 docs indexed:50000
lowest of each, percent compared with lowest
RealtimeWriter:7597 dif:0% 
LuceneWriter:12940 dif:70%
LuceneRealtimeWriter:25882 dif:241%


> Benchmark of different in RAM realtime techniques
> -------------------------------------------------
>
>                 Key: LUCENE-1577
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1577
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>    Affects Versions: 2.4.1
>            Reporter: Jason Rutherglen
>            Priority: Minor
>             Fix For: 2.9
>
>         Attachments: LUCENE-1577.patch
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> A place to post code that benchmarks the differences in the speed of indexing and searching
using different realtime techniques.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message