datafu-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eyal Allweil (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DATAFU-87) Edit distance
Date Mon, 09 Oct 2017 15:05:01 GMT

    [ https://issues.apache.org/jira/browse/DATAFU-87?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16197120#comment-16197120
] 

Eyal Allweil commented on DATAFU-87:
------------------------------------

On second thought, since this UDF is now available in Hive, and since Levenshtein distance
is a purely local computation, I'm guessing there's no need for a specific DataFu implementation.
Shall we close this issue?

Here are some links to the Hive UDF.

https://issues.apache.org/jira/browse/HIVE-9556

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-StringFunctions



> Edit distance
> -------------
>
>                 Key: DATAFU-87
>                 URL: https://issues.apache.org/jira/browse/DATAFU-87
>             Project: DataFu
>          Issue Type: New Feature
>    Affects Versions: 1.3.0
>            Reporter: Joydeep Banerjee
>         Attachments: DATAFU-87.patch
>
>
> [This is work-in-progress]
> Given 2 strings, provide a measure of dis-similarity (Levenshtein distance) between them.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message