Return-Path: Delivered-To: apmail-cocoon-dev-archive@www.apache.org Received: (qmail 76952 invoked from network); 15 Jun 2005 15:31:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 15 Jun 2005 15:31:27 -0000 Received: (qmail 61214 invoked by uid 500); 15 Jun 2005 15:31:24 -0000 Delivered-To: apmail-cocoon-dev-archive@cocoon.apache.org Received: (qmail 61146 invoked by uid 500); 15 Jun 2005 15:31:23 -0000 Mailing-List: contact dev-help@cocoon.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: dev@cocoon.apache.org List-Id: Delivered-To: mailing list dev@cocoon.apache.org Received: (qmail 61121 invoked by uid 99); 15 Jun 2005 15:31:23 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from dd2020.kasserver.com (HELO dd2020.kasserver.com) (81.209.148.130) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 15 Jun 2005 08:31:22 -0700 Received: from [192.168.1.3] (p54B19FAC.dip0.t-ipconnect.de [84.177.159.172]) by dd2020.kasserver.com (Postfix) with ESMTP id 745B5163972 for ; Wed, 15 Jun 2005 17:24:29 +0200 (CEST) Message-ID: <42B04828.80602@apache.org> Date: Wed, 15 Jun 2005 17:24:24 +0200 From: Torsten Curdt User-Agent: Mozilla Thunderbird 1.0.2 (Macintosh/20050317) X-Accept-Language: en-us, en MIME-Version: 1.0 To: dev@cocoon.apache.org Subject: Re: [OT] Determining the similarity between a pair of texts References: <7e895c1c3b1f9f27c2c4ced5f2086f89@apache.org> In-Reply-To: X-Enigmail-Version: 0.90.1.0 X-Enigmail-Supports: pgp-inline, pgp-mime Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enig40659428C264F3CED5EE1538" X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enig40659428C264F3CED5EE1538 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit >>Excuse the Off-Topic, but I'm looking for a Java API for determining >>the degree of similarity (based on word frequency or whatever) between >>two text strings. also commons codec has some algorithms ...depends on what you are after exactly http://jakarta.apache.org/commons/codec/ cheers -- Torsten --------------enig40659428C264F3CED5EE1538 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (Darwin) iD8DBQFCsEgoBGM6V3wgCUERAj8YAJ45PwwycZE81mRTdhWULGKcJZChBwCeODb4 Vx65jMIam675rPkKQQz+Uno= =BeEQ -----END PGP SIGNATURE----- --------------enig40659428C264F3CED5EE1538--