spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph K. Bradley (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (SPARK-16920) Investigate and fix issues introduced in SPARK-15858
Date Thu, 23 Feb 2017 23:22:44 GMT

     [ https://issues.apache.org/jira/browse/SPARK-16920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Joseph K. Bradley reassigned SPARK-16920:
-----------------------------------------

    Assignee: Mahmoud Rawas

> Investigate and fix issues introduced in SPARK-15858
> ----------------------------------------------------
>
>                 Key: SPARK-16920
>                 URL: https://issues.apache.org/jira/browse/SPARK-16920
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Vladimir Feinberg
>            Assignee: Mahmoud Rawas
>             Fix For: 2.2.0
>
>
> There were several issues regarding the PR resolving SPARK-15858, my comments are available
here:
> https://github.com/apache/spark/commit/393db655c3c43155305fbba1b2f8c48a95f18d93
> The two most important issues are:
> 1. The PR did not add a stress test proving it resolved the issue it was supposed to
(though I have no doubt the optimization made is indeed correct).
> 2. The PR introduced quadratic prediction time in terms of the number of trees, which
was previously linear. This issue needs to be investigated for whether it causes problems
for large numbers of trees (say, 1000), an appropriate test should be added, and then fixed.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message