spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From rxin <...@git.apache.org>
Subject [GitHub] incubator-spark pull request: MLI-2: Start adding k-fold cross val...
Date Mon, 17 Feb 2014 19:39:15 GMT
Github user rxin commented on the pull request:

    https://github.com/apache/incubator-spark/pull/572#discussion_r9802309
  
    ```scala
    (1 to numFolds).map { fold =>
      val sampler = new BernoulliSampler[T]((fold-1)/foldsF,fold/foldsF, complement = false)
      val train = new PartitionwiseSampledRDD(rdd, sampler , seed)
      val test = new PartitionwiseSampledRDD(rdd, sampler , seed.complement)  // might need
to create this
      (train, test)
    }
    ```
    
    Make sure you rename folds to numFolds.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. To do so, please top-post your response.
If your project does not have this feature enabled and wishes so, or if the
feature is enabled but not working, please contact infrastructure at
infrastructure@apache.org or file a JIRA ticket with INFRA.
---

Mime
View raw message