lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lance Norskog <>
Subject RemoveDuplicatesTokenFilter redundancy problem?
Date Wed, 23 Dec 2009 00:47:21 GMT
It looks like the inner loop of
org.apache.solr.analysis.RemoveDuplicatesTokenFilter could use a
'break'. I don't remember enough Big-O analysis to give the
difference, but they will be two different formulae.

For people doing large documents (I've heard gigabytes for email
forensics) this would matter...

Lance Norskog

View raw message