commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eric Barnhill <ericbarnh...@gmail.com>
Subject Re: [MATH] MATH-1378: KMeansPlusPlusClusterer optimize seeding procedure.
Date Thu, 23 Jun 2016 13:37:24 GMT
I use kmeans a bit and I will look at it.

On Thu, Jun 23, 2016 at 2:10 PM, Artem Barger <artem@bargr.net> wrote:

> Hi all,
>
> While I understand there is a project decision threads are going on ML,
> however I'd like to suggest and provide some improvements of CM kmeans++
> implementation in the seeding procedure. Currently sum of squared distances
> computed each iteration during initial centers seeding, which is redundant
> since sum can be computed once and updated within the cycle.
>
>
> Subjected JIRA item explains the optimization and I've also provided patch
> with suggested fix. Would be glad to hear any comments or reviews.
>
>
> Best regards,
>                       Artem Barger.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message