spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marco Gaido (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-23568) Silhouette should get number of features from metadata if available
Date Fri, 02 Mar 2018 17:09:00 GMT
Marco Gaido created SPARK-23568:
-----------------------------------

             Summary: Silhouette should get number of features from metadata if available
                 Key: SPARK-23568
                 URL: https://issues.apache.org/jira/browse/SPARK-23568
             Project: Spark
          Issue Type: Improvement
          Components: ML
    Affects Versions: 2.4.0
            Reporter: Marco Gaido


In Silhouette computation we need to know the number of features. This is done taking the
first row and checking the size of the features vector. Despite it works fine, if the number
of attributes is present in the metadata of the column, we can avoid the additional job which
is generated by using `first`.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message