Return-Path: X-Original-To: apmail-spark-commits-archive@minotaur.apache.org Delivered-To: apmail-spark-commits-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5E76011EA6 for ; Tue, 20 May 2014 04:29:54 +0000 (UTC) Received: (qmail 77565 invoked by uid 500); 20 May 2014 04:29:54 -0000 Delivered-To: apmail-spark-commits-archive@spark.apache.org Received: (qmail 77491 invoked by uid 500); 20 May 2014 04:29:54 -0000 Mailing-List: contact commits-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@spark.apache.org Delivered-To: mailing list commits@spark.apache.org Received: (qmail 77379 invoked by uid 99); 20 May 2014 04:29:54 -0000 Received: from tyr.zones.apache.org (HELO tyr.zones.apache.org) (140.211.11.114) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 20 May 2014 04:29:54 +0000 Received: by tyr.zones.apache.org (Postfix, from userid 65534) id 0812B936DD5; Tue, 20 May 2014 04:29:54 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: tdas@apache.org To: commits@spark.apache.org Date: Tue, 20 May 2014 04:29:56 -0000 Message-Id: In-Reply-To: <5336021009b24e3798b70b7e4ca288d3@git.apache.org> References: <5336021009b24e3798b70b7e4ca288d3@git.apache.org> X-Mailer: ASF-Git Admin Mailer Subject: [4/4] git commit: [SPARK-1874][MLLIB] Clean up MLlib sample data [SPARK-1874][MLLIB] Clean up MLlib sample data 1. Added synthetic datasets for `MovieLensALS`, `LinearRegression`, `BinaryClassification`. 2. Embedded instructions in the help message of those example apps. Per discussion with Matei on the JIRA page, new example data is under `data/mllib`. Author: Xiangrui Meng Closes #833 from mengxr/mllib-sample-data and squashes the following commits: 59f0a18 [Xiangrui Meng] add sample binary classification data 3c2f92f [Xiangrui Meng] add linear regression data 050f1ca [Xiangrui Meng] add a sample dataset for MovieLensALS example (cherry picked from commit bcb9dce6f444a977c714117811bce0c54b417650) Signed-off-by: Tathagata Das Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1c6c8b5b Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/1c6c8b5b Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/1c6c8b5b Branch: refs/heads/branch-1.0 Commit: 1c6c8b5bd2bdecfc5fdabd33ee8762fe25b0e69a Parents: 78b6e6f Author: Xiangrui Meng Authored: Mon May 19 21:29:33 2014 -0700 Committer: Tathagata Das Committed: Mon May 19 21:29:50 2014 -0700 ---------------------------------------------------------------------- .../mllib/sample_binary_classification_data.txt | 100 ++ data/mllib/sample_linear_regression_data.txt | 501 ++++++ data/mllib/sample_movielens_data.txt | 1501 ++++++++++++++++++ .../examples/mllib/BinaryClassification.scala | 12 +- .../spark/examples/mllib/LinearRegression.scala | 11 +- .../spark/examples/mllib/MovieLensALS.scala | 15 + 6 files changed, 2138 insertions(+), 2 deletions(-) ----------------------------------------------------------------------