flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ufuk Celebi <...@apache.org>
Subject Re: Custom keyBy(), look for similaties
Date Mon, 06 Jun 2016 14:12:37 GMT
Hey Iñaki,

you can use the KeySelector as described here:
https://ci.apache.org/projects/flink/flink-docs-release-1.0/apis/common/index.html#specifying-keys

But you only a local view for the current element, e.g. the library
you use to determine the similarity has to know the similarities
upfront.

– Ufuk


On Mon, Jun 6, 2016 at 9:31 AM, iñaki williams <juanramallo80@gmail.com> wrote:
> Hi guys,
>
> I am using Flink on my project and I have a question. (I am using Java)
>
> Is it possible to modify the keyby method in order to key by similarities
> and not by the exact name?
>
> Example: I recieve 2 DataStreams, in the first one , the name of the field
> that I want to KeyBy is "John Locke", while in the Datastream the field
> value is "John L". Can I use some java library to find for similarities
> between strings and if the similitude is high, then key those elements
> together.

Mime
View raw message