spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-14153) My dataset does not provide proper predictions in ALS
Date Tue, 29 Mar 2016 14:17:25 GMT

    [ https://issues.apache.org/jira/browse/SPARK-14153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216058#comment-15216058
] 

Sean Owen commented on SPARK-14153:
-----------------------------------

Please put your code inline but use {{ {code} }} to format it.
Yeah, that's my point. In the previous version, you didn't change your test set, it seems.

> My dataset does not provide proper predictions in ALS
> -----------------------------------------------------
>
>                 Key: SPARK-14153
>                 URL: https://issues.apache.org/jira/browse/SPARK-14153
>             Project: Spark
>          Issue Type: Question
>          Components: Java API, ML
>            Reporter: Dulaj Rajitha
>
> When I used data-set in the git-hub example, I get proper predictions. But when I used
my data set It does not predict well. (I has a large RMSE). 
> I used cross validator for ALS  (in Spark ML) and here are the best model parameters.
> 16/03/25 12:03:06 INFO CrossValidator: Average cross-validation metrics: WrappedArray(NaN,
NaN, NaN, NaN, NaN, NaN, NaN, NaN)
> 16/03/25 12:03:06 INFO CrossValidator: Best set of parameters:
> {
> 	als_c911c0e183a3-alpha: 0.02,
> 	als_c911c0e183a3-rank: 500,
> 	als_c911c0e183a3-regParam: 0.03
> }
> But when I used movie data set It gives proper values for parameters. as below
> 16/03/24 14:07:07 INFO CrossValidator: Average cross-validation metrics: WrappedArray(1.9481584447713676,
2.0501457159728944, 2.0600857505406935, 1.9457234533860048, 2.0494498583414282, 2.0595306613827002,
1.9488322049918922, 2.0489573853226797, 2.0584252131752, 1.9464006741621391, 2.048241271354197,
2.057853990227443)
> 16/03/24 14:07:07 INFO CrossValidator: Best set of parameters:
> {
> 	als_31a605e7717b-alpha: 0.02,
> 	als_31a605e7717b-rank: 1,
> 	als_31a605e7717b-regParam: 0.02
> }
> 16/03/24 14:07:07 INFO CrossValidator: Best cross-validation metric: 1.9457234533860048.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message