crunch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Leen Toelen <>
Subject performance impact of batching emit(...)
Date Thu, 09 Jan 2014 21:30:11 GMT

when looking at PreDistinct I notice that calls to emitter.emit(...) are
stored in memory until more than 'flushEvery' records are found. How does
this batching impact performance, since the calls to emit(...) are not
batched in the cleanup method but called in a loop?

Is there an easy way to find the best size for 'flushEvery' other than try
and error?

Best regards,

View raw message