flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fabian Hueske <fhue...@gmail.com>
Subject Re: Multi-dimensional[more than 2] input for KMeans Clustering in Apache flink
Date Tue, 01 Mar 2016 16:22:57 GMT
Hi Subash,

the KMeans implementation in Flink is meant to be a simple toy example and
should not used for serious analysis tasks.
It shows how the DataSet API works by implementing a well-known algorithm.

Nonetheless, the example can be easily extended to work for three or more
dimensions.
You would need to adapt the methods to compute the distance and the
location of the new center.

Best, Fabian

2016-03-01 17:07 GMT+01:00 subash basnet <yasubash@gmail.com>:

> Hello all,
>
> Currently I find only two-dimension input possible for the KMeans
> Clustering in flink.
>
> Is there any implementation already or what should be the approach to
> implement
> more than 2 dimensional input for KMeans in flink?
> Or is there any other clustering method which taking more than two
> dimensional data as input implemented in flink?
>
>
> Best Regards,
> Subash Basnet
>

Mime
View raw message