spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <>
Subject [jira] [Commented] (SPARK-22119) Add cosine distance to KMeans
Date Mon, 25 Sep 2017 16:37:01 GMT


Apache Spark commented on SPARK-22119:

User 'mgaido91' has created a pull request for this issue:

> Add cosine distance to KMeans
> -----------------------------
>                 Key: SPARK-22119
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: ML, MLlib
>    Affects Versions: 2.2.0
>            Reporter: Marco Gaido
>            Priority: Minor
> Currently, KMeans assumes the only possible distance measure to be used is the Euclidean.
> In some use cases, eg. text mining, other distance measures like the cosine distance
are widely used. Thus, for such use cases, it would be good to support multiple distance measures.
> This ticket is to support the cosine distance measure on KMeans. Later, other algorithms
can be extended to support several distance measures and other distance measures can be added.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message