mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From rawkintr...@apache.org
Subject [1/2] mahout git commit: NO-JIRA Add ASFv2 License
Date Wed, 03 May 2017 05:08:51 GMT
Repository: mahout
Updated Branches:
  refs/heads/website 516e3fb9a -> 0b38f5167


NO-JIRA Add ASFv2 License


Project: http://git-wip-us.apache.org/repos/asf/mahout/repo
Commit: http://git-wip-us.apache.org/repos/asf/mahout/commit/c8bdf2ef
Tree: http://git-wip-us.apache.org/repos/asf/mahout/tree/c8bdf2ef
Diff: http://git-wip-us.apache.org/repos/asf/mahout/diff/c8bdf2ef

Branch: refs/heads/website
Commit: c8bdf2efc66d9dbba07d0c50d8369fb721581149
Parents: 516e3fb
Author: rawkintrevo <trevor.d.grant@gmail.com>
Authored: Wed May 3 00:07:22 2017 -0500
Committer: rawkintrevo <trevor.d.grant@gmail.com>
Committed: Wed May 3 00:07:22 2017 -0500

----------------------------------------------------------------------
 examples/bin/SparseSparseDrmTimer.mscala        | 17 +++++++++-
 website/docs/algorithms/linear-algebra/index.md |  0
 .../docs/algorithms/preprocessors/template.md   | 20 ------------
 website/docs/algorithms/reccomenders/index.md   | 33 ++++++++++++++++++++
 .../reccomenders/recommender-overview.md        | 33 --------------------
 .../algorithms/regression/cochrane-orcutt.md    |  9 ------
 .../algorithms/regression/fittness-tests.md     |  0
 website/docs/algorithms/regression/index.md     |  0
 .../serial-correlation/cochrane-orcutt.md       |  9 ++++++
 .../regression/serial-correlation/dw-test.md    |  0
 website/docs/algorithms/template.md             | 20 ++++++++++++
 .../docs/tutorials/mahout-in-zeppelin/index.md  |  0
 12 files changed, 78 insertions(+), 63 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/examples/bin/SparseSparseDrmTimer.mscala
----------------------------------------------------------------------
diff --git a/examples/bin/SparseSparseDrmTimer.mscala b/examples/bin/SparseSparseDrmTimer.mscala
index 947dd73..3cd61d5 100644
--- a/examples/bin/SparseSparseDrmTimer.mscala
+++ b/examples/bin/SparseSparseDrmTimer.mscala
@@ -1,4 +1,19 @@
-
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *     http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+*/
 
 def timeSparseDRMMMul(m: Int, n: Int, s: Int, para: Int, pctDense: Double = .20, seed: Long
= 1234L): Long = {
 

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/linear-algebra/index.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/linear-algebra/index.md b/website/docs/algorithms/linear-algebra/index.md
new file mode 100644
index 0000000..e69de29

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/preprocessors/template.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/preprocessors/template.md b/website/docs/algorithms/preprocessors/template.md
deleted file mode 100644
index 4a48829..0000000
--- a/website/docs/algorithms/preprocessors/template.md
+++ /dev/null
@@ -1,20 +0,0 @@
----
-layout: algorithm
-title: AsFactor
-theme:
-    name: mahout2
----
-
-TODO: Fill this out!
-Stub
-
-### About
-
-### Parameters
-
-### Example
-
-
-
-
-

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/reccomenders/index.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/reccomenders/index.md b/website/docs/algorithms/reccomenders/index.md
new file mode 100644
index 0000000..00d8ec4
--- /dev/null
+++ b/website/docs/algorithms/reccomenders/index.md
@@ -0,0 +1,33 @@
+---
+layout: algorithm
+title: Recommender Quickstart
+theme:
+    name: retro-mahout
+---
+
+
+# Recommender Overview
+
+Recommenders have changed over the years. Mahout contains a long list of them, which you
can still use. But to get the best  out of our more modern aproach we'll need to think of
the Recommender as a "model creation" component&mdash;supplied by Mahout's new spark-itemsimilarity
job, and a "serving" component&mdash;supplied by a modern scalable search engine, like
Solr.
+
+![image](http://i.imgur.com/fliHMBo.png)
+
+To integrate with your application you will collect user interactions storing them in a DB
and also in a from usable by Mahout. The simplest way to do this is to log user interactions
to csv files (user-id, item-id). The DB should be setup to contain the last n user interactions,
which will form part of the query for recommendations.
+
+Mahout's spark-itemsimilarity will create a table of (item-id, list-of-similar-items) in
csv form. Think of this as an item collection with one field containing the item-ids of similar
items. Index this with your search engine. 
+
+When your application needs recommendations for a specific person, get the latest user history
of interactions from the DB and query the indicator collection with this history. You will
get back an ordered list of item-ids. These are your recommendations. You may wish to filter
out any that the user has already seen but that will depend on your use case.
+
+All ids for users and items are preserved as string tokens and so work as an external key
in DBs or as doc ids for search engines, they also work as tokens for search queries.
+
+##References
+
+1. A free ebook, which talks about the general idea: [Practical Machine Learning](https://www.mapr.com/practical-machine-learning)
+2. A slide deck, which talks about mixing actions or other indicators: [Creating a Multimodal
Recommender with Mahout and a Search Engine](http://occamsmachete.com/ml/2014/10/07/creating-a-unified-recommender-with-mahout-and-a-search-engine/)
+3. Two blog posts: [What's New in Recommenders: part #1](http://occamsmachete.com/ml/2014/08/11/mahout-on-spark-whats-new-in-recommenders/)
+and  [What's New in Recommenders: part #2](http://occamsmachete.com/ml/2014/09/09/mahout-on-spark-whats-new-in-recommenders-part-2/)
+3. A post describing the loglikelihood ratio:  [Surprise and Coinsidense](http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html)
 LLR is used to reduce noise in the data while keeping the calculations O(n) complexity.
+
+##Mahout Model Creation
+
+See the page describing [*spark-itemsimilarity*](http://mahout.apache.org/users/recommender/intro-cooccurrence-spark.html)
for more details.
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/reccomenders/recommender-overview.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/reccomenders/recommender-overview.md b/website/docs/algorithms/reccomenders/recommender-overview.md
deleted file mode 100644
index 00d8ec4..0000000
--- a/website/docs/algorithms/reccomenders/recommender-overview.md
+++ /dev/null
@@ -1,33 +0,0 @@
----
-layout: algorithm
-title: Recommender Quickstart
-theme:
-    name: retro-mahout
----
-
-
-# Recommender Overview
-
-Recommenders have changed over the years. Mahout contains a long list of them, which you
can still use. But to get the best  out of our more modern aproach we'll need to think of
the Recommender as a "model creation" component&mdash;supplied by Mahout's new spark-itemsimilarity
job, and a "serving" component&mdash;supplied by a modern scalable search engine, like
Solr.
-
-![image](http://i.imgur.com/fliHMBo.png)
-
-To integrate with your application you will collect user interactions storing them in a DB
and also in a from usable by Mahout. The simplest way to do this is to log user interactions
to csv files (user-id, item-id). The DB should be setup to contain the last n user interactions,
which will form part of the query for recommendations.
-
-Mahout's spark-itemsimilarity will create a table of (item-id, list-of-similar-items) in
csv form. Think of this as an item collection with one field containing the item-ids of similar
items. Index this with your search engine. 
-
-When your application needs recommendations for a specific person, get the latest user history
of interactions from the DB and query the indicator collection with this history. You will
get back an ordered list of item-ids. These are your recommendations. You may wish to filter
out any that the user has already seen but that will depend on your use case.
-
-All ids for users and items are preserved as string tokens and so work as an external key
in DBs or as doc ids for search engines, they also work as tokens for search queries.
-
-##References
-
-1. A free ebook, which talks about the general idea: [Practical Machine Learning](https://www.mapr.com/practical-machine-learning)
-2. A slide deck, which talks about mixing actions or other indicators: [Creating a Multimodal
Recommender with Mahout and a Search Engine](http://occamsmachete.com/ml/2014/10/07/creating-a-unified-recommender-with-mahout-and-a-search-engine/)
-3. Two blog posts: [What's New in Recommenders: part #1](http://occamsmachete.com/ml/2014/08/11/mahout-on-spark-whats-new-in-recommenders/)
-and  [What's New in Recommenders: part #2](http://occamsmachete.com/ml/2014/09/09/mahout-on-spark-whats-new-in-recommenders-part-2/)
-3. A post describing the loglikelihood ratio:  [Surprise and Coinsidense](http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html)
 LLR is used to reduce noise in the data while keeping the calculations O(n) complexity.
-
-##Mahout Model Creation
-
-See the page describing [*spark-itemsimilarity*](http://mahout.apache.org/users/recommender/intro-cooccurrence-spark.html)
for more details.
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/regression/cochrane-orcutt.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/regression/cochrane-orcutt.md b/website/docs/algorithms/regression/cochrane-orcutt.md
deleted file mode 100644
index a88a0a0..0000000
--- a/website/docs/algorithms/regression/cochrane-orcutt.md
+++ /dev/null
@@ -1,9 +0,0 @@
----
-layout: algorithm
-title: Cochrane-Orcutt Procedure
-theme:
-    name: mahout2
----
-
-TODO: Fill this out!
-Stub
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/regression/fittness-tests.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/regression/fittness-tests.md b/website/docs/algorithms/regression/fittness-tests.md
new file mode 100644
index 0000000..e69de29

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/regression/index.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/regression/index.md b/website/docs/algorithms/regression/index.md
new file mode 100644
index 0000000..e69de29

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/regression/serial-correlation/cochrane-orcutt.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/regression/serial-correlation/cochrane-orcutt.md b/website/docs/algorithms/regression/serial-correlation/cochrane-orcutt.md
new file mode 100644
index 0000000..a88a0a0
--- /dev/null
+++ b/website/docs/algorithms/regression/serial-correlation/cochrane-orcutt.md
@@ -0,0 +1,9 @@
+---
+layout: algorithm
+title: Cochrane-Orcutt Procedure
+theme:
+    name: mahout2
+---
+
+TODO: Fill this out!
+Stub
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/regression/serial-correlation/dw-test.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/regression/serial-correlation/dw-test.md b/website/docs/algorithms/regression/serial-correlation/dw-test.md
new file mode 100644
index 0000000..e69de29

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/algorithms/template.md
----------------------------------------------------------------------
diff --git a/website/docs/algorithms/template.md b/website/docs/algorithms/template.md
new file mode 100644
index 0000000..4a48829
--- /dev/null
+++ b/website/docs/algorithms/template.md
@@ -0,0 +1,20 @@
+---
+layout: algorithm
+title: AsFactor
+theme:
+    name: mahout2
+---
+
+TODO: Fill this out!
+Stub
+
+### About
+
+### Parameters
+
+### Example
+
+
+
+
+

http://git-wip-us.apache.org/repos/asf/mahout/blob/c8bdf2ef/website/docs/tutorials/mahout-in-zeppelin/index.md
----------------------------------------------------------------------
diff --git a/website/docs/tutorials/mahout-in-zeppelin/index.md b/website/docs/tutorials/mahout-in-zeppelin/index.md
new file mode 100644
index 0000000..e69de29


Mime
View raw message