spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-22422) Add Adjusted R2 to RegressionMetrics
Date Thu, 02 Nov 2017 05:17:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-22422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16235195#comment-16235195
] 

Apache Spark commented on SPARK-22422:
--------------------------------------

User 'tengpeng' has created a pull request for this issue:
https://github.com/apache/spark/pull/19638

> Add Adjusted R2 to RegressionMetrics
> ------------------------------------
>
>                 Key: SPARK-22422
>                 URL: https://issues.apache.org/jira/browse/SPARK-22422
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>    Affects Versions: 2.2.0
>            Reporter: Teng Peng
>            Priority: Minor
>
> In practice, no one looks at R2 alone. The reason is R2 itself is misleading. If we add
more parameters, R2 will not decrease but only increase (or stay the same). This leads to
overfitting.
> I added adjusted R2 as the metric which was implemented in all major statistical analysis
tools.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message