Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5C46E200D26 for ; Fri, 20 Oct 2017 18:05:09 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 5AF95160BEE; Fri, 20 Oct 2017 16:05:09 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id AB41D160BCB for ; Fri, 20 Oct 2017 18:05:08 +0200 (CEST) Received: (qmail 35418 invoked by uid 500); 20 Oct 2017 16:05:07 -0000 Mailing-List: contact issues-help@commons.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: issues@commons.apache.org Delivered-To: mailing list issues@commons.apache.org Received: (qmail 35395 invoked by uid 99); 20 Oct 2017 16:05:07 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Oct 2017 16:05:07 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 7D37D180725 for ; Fri, 20 Oct 2017 16:05:06 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id o2ID0UOnqHzU for ; Fri, 20 Oct 2017 16:05:05 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id BEFDA5FE64 for ; Fri, 20 Oct 2017 16:05:04 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id A8AD9E2576 for ; Fri, 20 Oct 2017 16:05:02 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id BFDD6243A7 for ; Fri, 20 Oct 2017 16:05:00 +0000 (UTC) Date: Fri, 20 Oct 2017 16:05:00 +0000 (UTC) From: "Pascal Schumacher (JIRA)" To: issues@commons.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (TEXT-103) Add provision to change the cost for insert, delete and replace operation in levenshtein distance MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 20 Oct 2017 16:05:09 -0000 [ https://issues.apache.org/jira/browse/TEXT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16212824#comment-16212824 ] Pascal Schumacher commented on TEXT-103: ---------------------------------------- [~arohit] Consider this issue assigned to you anyway. Looking forward to the pull request/patch. Thanks in advance! (Sadly it is not possible to assign issues to people who are not part of the commons developer group in jira, so it has to stay unassigned in jira.) > Add provision to change the cost for insert, delete and replace operation in levenshtein distance > ------------------------------------------------------------------------------------------------- > > Key: TEXT-103 > URL: https://issues.apache.org/jira/browse/TEXT-103 > Project: Commons Text > Issue Type: Improvement > Reporter: Rohit Agarwal > Priority: Minor > Labels: newbie, patch > Original Estimate: 48h > Remaining Estimate: 48h > > There are two implementation of levenshtein distance, unlimitedCompare and limitedCompare. > I propose to generalise the levenshtein distance by adding an option to change the value of > 1) Addition of Character. > 2) Deletion of Character. > 3) Substitution of Character. > Currently they are all set to 1. For backward compatibility this will be the default case. -- This message was sent by Atlassian JIRA (v6.4.14#64029)