Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D4D879C67 for ; Wed, 28 Dec 2011 16:58:54 +0000 (UTC) Received: (qmail 88925 invoked by uid 500); 28 Dec 2011 16:58:53 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 88830 invoked by uid 500); 28 Dec 2011 16:58:53 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 88823 invoked by uid 99); 28 Dec 2011 16:58:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Dec 2011 16:58:53 +0000 X-ASF-Spam-Status: No, hits=-2001.3 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Dec 2011 16:58:51 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 2DF1112DD28 for ; Wed, 28 Dec 2011 16:58:31 +0000 (UTC) Date: Wed, 28 Dec 2011 16:58:31 +0000 (UTC) From: "Robert Muir (Resolved) (JIRA)" To: dev@lucene.apache.org Message-ID: <173013854.48778.1325091511189.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1518386263.33422.1324427970832.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Resolved] (SOLR-2982) Upgrade Apache Commons Codec to version 1.6 in order to add new Beider-Morse Phonetic Matching (BMPM) option MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/SOLR-2982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-2982. ------------------------------- Resolution: Fixed > Upgrade Apache Commons Codec to version 1.6 in order to add new Beider-Morse Phonetic Matching (BMPM) option > ------------------------------------------------------------------------------------------------------------ > > Key: SOLR-2982 > URL: https://issues.apache.org/jira/browse/SOLR-2982 > Project: Solr > Issue Type: Improvement > Components: Rules, Schema and Analysis, search > Reporter: Brooke Schreier Ganz > Labels: codec, commons, commons-codec, language, names, phonetic, search, searching, soundalike > Fix For: 3.6, 4.0 > > Attachments: SOLR-2982.patch > > > Apache Commons Codec released version 1.6 of their codec pack in November, 2011. Along with a few bug fixes, 1.6 contains a great new phonetic matching system called Beider-Morse Phonetic Matching (BMPM) that is far superior to the existing phonetic codecs, such as regular soundex, metaphone, caverphone, and so on. BMPM has actually been available for some time, but this is the first port of it to java, and its first commit in the Apache ecosystem. > For a lot more information, see here: http://stevemorse.org/phoneticinfo.htm and http://stevemorse.org/phonetics/bmpm.htm > BMPM would be a fantastic "soundalike" tool to help search for personal names (or just surnames) in a Solr/Lucene index, much better than Levenshtein distance for this use case. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org