lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: Is indexing much slower in 3.5.0 than in 2.4.1 for Wikipedia data?
Date Wed, 14 Dec 2011 13:31:03 GMT
On Tue, Dec 13, 2011 at 4:45 PM, Sean Tong <stong@jamasoftware.com> wrote:
> Hi,
>
> I modified the DocMaker in 3.5 to make it index the same 4 fields as 2.4.1 does. Now
I got  very similar stats in the index by checking Luke. The index performance was slightly
better than that by indexing 7 fields but still not comparable with the 2.4.1 performance:
>
> [java] ------------> Report sum by Prefix (MAddDocs) and Round (3 about 3 out of 14)
>     [java] Operation       round flush mrg   runCnt   recsPerRun        rec/s
 elapsedSec    avgUsedMem    avgTotalMem
>     [java] MAddDocs_200000     0 16.00  10        1       200000      
767.18      260.70   113,206,984    144,637,952
>     [java] MAddDocs_200000 -   1 16.00  10 -  -   1 -  -  200000 -  -  801.61
-  - 249.50 - 117,778,992 -  144,637,952
>     [java] MAddDocs_200000     2 16.00  10        1       200000      
734.39      272.33   121,479,568    126,287,872
>
> Maybe there are some other settings that make the benchmarks not comparable.

I think for benchmarking, this CloseIndex task should default to
doWait=false... maybe try passing that parameter to it.

-- 
lucidimagination.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message