commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 38911] New: - Commons Lang StringUtils#getLevenshteinDistance() performance is sub-optimal
Date Thu, 09 Mar 2006 18:20:46 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG·
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=38911>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND·
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=38911

           Summary: Commons Lang StringUtils#getLevenshteinDistance()
                    performance is sub-optimal
           Product: Commons
           Version: unspecified
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Lang
        AssignedTo: commons-dev@jakarta.apache.org
        ReportedBy: cedrik.lime@gmail.com


The implementation of Commons Lang StringUtils#getLevenshteinDistance(String,
String) is based on work from <http://www.merriampark.com/ld.htm>. While this
implementation works, it is *very* memory hungry and can thus slow down heavy
computations (GC has much more to collect in memory-constrained environment).
Actual implementation needs x*y byte of memory.

An improved implementation can be found at
<http://www.merriampark.com/ldjava.htm>, which can lead to performance
improvements of up to 3 times (my own internal benchmarks in low-memory
situation). This new implementation needs x+y bytes of memory.

Please change the getLevenshteinDistance() implementation to use the one at
<http://www.merriampark.com/ldjava.htm>.

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


Mime
View raw message