lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ian Boston <...@tfd.co.uk>
Subject A Faster ISOLatin1AccentFilter
Date Sun, 29 Apr 2007 09:55:08 GMT
Hi,

We've been using this filter in a project and found it a bit slow, so 
we've re-written it. In tests we did on a 5M string with no accents, we 
found the version in trunk to take ~200ms, this version takes about 
~12ms. If there are accents, then its not quite as good at about 60ms. 
(MacBookPro)

Code is at
https://saffron.caret.cam.ac.uk/svn/projects/darwincorresp/trunk/darwin-analyzer/src/main/java/uk/ac/cam/caret/darwin/lucene/ISOLatin1AccentFilter.java

BTW, just replacing StringBuffer with StringBuilder got down to about 
80ms for both cases.

Ian

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message