Return-Path: X-Original-To: apmail-mahout-commits-archive@www.apache.org Delivered-To: apmail-mahout-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0ACD9C5B5 for ; Sun, 3 Nov 2013 21:37:10 +0000 (UTC) Received: (qmail 51880 invoked by uid 500); 3 Nov 2013 21:37:09 -0000 Delivered-To: apmail-mahout-commits-archive@mahout.apache.org Received: (qmail 51832 invoked by uid 500); 3 Nov 2013 21:37:09 -0000 Mailing-List: contact commits-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mahout.apache.org Delivered-To: mailing list commits@mahout.apache.org Received: (qmail 51825 invoked by uid 99); 3 Nov 2013 21:37:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Nov 2013 21:37:09 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO eris.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 03 Nov 2013 21:36:55 +0000 Received: from eris.apache.org (localhost [127.0.0.1]) by eris.apache.org (Postfix) with ESMTP id 6839A2388A67; Sun, 3 Nov 2013 21:36:32 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Subject: svn commit: r1538467 [4/20] - in /mahout/site/mahout_cms: ./ cgi-bin/ content/ content/css/ content/developers/ content/general/ content/images/ content/js/ content/users/ content/users/basics/ content/users/classification/ content/users/clustering/ co... Date: Sun, 03 Nov 2013 21:36:27 -0000 To: commits@mahout.apache.org From: isabel@apache.org X-Mailer: svnmailer-1.0.9 Message-Id: <20131103213632.6839A2388A67@eris.apache.org> X-Virus-Checked: Checked by ClamAV on apache.org Added: mahout/site/mahout_cms/content/general/faq.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/faq.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/faq.mdtext (added) +++ mahout/site/mahout_cms/content/general/faq.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,77 @@ +Title: FAQ +The Official Mahout FAQ + +*General* + +1. [What is Apache Mahout?](#whatis.html) +1. [What does the name mean?](#mean.html) +1. [Where can I find the origins of the Mahout project? ](#historical.html) +1. [Where can I download Mahout logo? ](#logo.html) +1. [Where can I download Mahout slide presentations? ](#presentations.html) + +*Algorithms* +1. [What algorithms are implemented in Mahout?](#algos.html) +1. [What algorithms are missing from Mahout?](#todo.html) + + + +# *Answers* + + + +## General + + + +### {anchor:whatIs}What is Apache Mahout? + +Apache Mahout is a suite of machine learning libraries designed to be +scalable and robust + + +### {anchor:mean}What does the name mean? + +The name [Mahout](http://en.wikipedia.org/wiki/Mahout) + was original chosen for it's association with the [Apache Hadoop|http://hadoop.apache.org] + project. A Mahout is a person who drives an elephant (hint: Hadoop's logo +is an elephant.) We just wanted a name that complemented Hadoop but we see +our project as a good driver of Hadoop in the sense that we will be using +and testing it. We are not, however, implying that we are controlling +Hadoop's development. + + +### {anchor:historical}Where can I find the origins of the Mahout project? + +See [http://ml-site.grantingersoll.com](http://ml-site.grantingersoll.com) + for old wiki and mailing list archives (all read-only) + + +### {anchor:logo}Where can I download Mahout logo? + +See [MAHOUT-335](https://issues.apache.org/jira/browse/MAHOUT-335) + + + +### {anchor:presentations} Where can I download Mahout slide presentations? + +* [Books, Tutorials and Talks ](https://cwiki.apache.org/confluence/display/MAHOUT/Books+Tutorials+and+Talks) + Wiki page containing an overview of all presentations with links to slides +where available. + + +## Algorithms + + +### {anchor:algos} What algorithms are implemented in Mahout? + +We are interested in a wide variety of machine learning algorithms. Many of +which are already implemented in Mahout. You can find them [here ](http://cwiki.apache.org/confluence/display/MAHOUT/Algorithms) +. + + +### {anchor:todo} What algorithms are missing from Mahout? + +There are many machine learning algorithms that we would like to have in +Mahout, including some from the paper [http://www.cs.stanford.edu/people/ang/papers/nips06-mapreducemulticore.pdf](http://www.cs.stanford.edu/people/ang/papers/nips06-mapreducemulticore.pdf) +. If you have an algorithm or an improvement to an algorithm that you would +like to implement, by all means submit a patch. Added: mahout/site/mahout_cms/content/general/glossary.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/glossary.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/glossary.mdtext (added) +++ mahout/site/mahout_cms/content/general/glossary.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,6 @@ +Title: Glossary +This is a list of common glossary terms used on both the mailing lists and +around the site. Where possible I have tried to provide a link to more +in-depth explanations from the web + +{children:excerpt=true|style=h4} Added: mahout/site/mahout_cms/content/general/mahout-benchmarks.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/mahout-benchmarks.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/mahout-benchmarks.mdtext (added) +++ mahout/site/mahout_cms/content/general/mahout-benchmarks.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,152 @@ +Title: Mahout Benchmarks + +# Introduction + +TODO: YMMV + + +# Recommenders + + +## A Rule of Thumb + +100M preferences are about the data set size where non-distributed +recommenders will outgrow a normal-sized machine (32-bit, <= 4GB RAM). Your +mileage will vary significantly with the nature of the data. + + +## Distributed recommender vs. Wikipedia links (May 27 2010) + +From the mailing list: + +I just finished running a set of recommendations based on the Wikipedia +link graph, for book purposes (yeah, it's unconventional). I ran on my +laptop, but it ought to be crudely representative of how it runs in a real +cluster. + +The input is 1058MB as a text file, and contains, 130M article-article +associations, from 5.7M articles to 3.8M distinct articles ("users" and +"items", respectively). I estimate cost based on Amazon's North +American small Linux-based instance pricing of $0.085/hour. I ran on a +dual-core laptop with plenty of RAM, allowing 1GB per worker, so this is +valid. + +In this run, I run recommendations for all 5.7M "users". You can certainly +run for any subset of all users of course. + +Phase 1 (Item ID to item index mapping) +29 minutes CPU time +$0.05 +60MB output + +Phase 2 (Create user vectors) +88 minutes CPU time +$0.13 +Output: 1159MB + +Phase 3 (Count co-occurrence) +77 hours CPU time +$6.54 +Output: 23.6GB + +Phase 4 (Partial multiply prep) +10.5 hours CPU time +$0.90 +Output: 24.6GB + +Phase 5 (Aggregate and recommend) +about 600 hours +about $51.00 +about 10GB +(I estimated these rather than let it run at home for days!) + + +Note that phases 1 and 3 may be run less frequently, and need not be run +every time. But the cost is dominated by the last step, which is most of +the work. I've ignored storage costs. + +This implies a cost of $0.01 (or about 8 instance-minutes) per 1,000 user +recommendations. That's not bad if, say, you want to update recs for you +site's 100,000 daily active users for a dollar. + +There are several levers one could pull internally to sacrifice accuracy +for speed, but it's currently set to pretty normal values. So this is just +one possibility. + +Now that's not terrible, but it is about 8x more computing than would be +needed by a non-distributed implementation *if* you could fit the whole +data set into a very large instance's memory, which is still possible at +this scale but needs a pretty big instance. That's a very apples-to-oranges +comparison of course; different algorithms, entirely different +environments. This is about the amount of overhead I'd expect from +distributing -- interesting to note how non-trivial it is. + + +## Non-distributed recommender vs. KDD Cup data set (March 2011) + +(From the user@mahout.apache.org mailing list) + +I've been test-driving a simple application of Mahout recommenders (the +non-distributed kind) on Amazon EC2 on the new Yahoo KDD Cup data set +(kddcup.yahoo.com). + +In the spirit of open-source, like I mentioned, I'm committing the extra +code to mahout-examples that can be used to run a Recommender on the input +and output the right format. And, I'd like to publish the rough timings +too. Find all the source in org.apache.mahout.cf.taste.example.kddcup + + +### Track 1 + +* m2.2xlarge instance, 34.2GB RAM / 4 cores +* Steady state memory consumption: ~19GB +* Computation time: 30 hours (wall clock-time) +* CPU time per user: ~0.43 sec +* Cost on EC2: $34.20 (!) + +(Helpful hint on cost I realized after the fact: you can almost surely get +spot instances for cheaper. The maximum price this sort of instance has +gone for as a spot instance is about $0.60/hour, vs "retail price" of +$1.14/hour.) + +Resulted in an RMSE of 29.5618 (the rating scale is 0-100), which is only +good enough for 29th place at the moment. Not terrible for "out of the box" +performance -- it's just using an item-based recommender with uncentered +cosine similarity. But not really good in absolute terms. A winning +solution is going to try to factor in time, and apply more sophisticated +techniques. The best RMSE so far is about 23. + + +### Track 2 + +* c1.xlarge instance: 7GB RAM / 8 cores +* Steady state memory consumption: ~3.8GB +* Computation time: 4.1 hours (wall clock-time) +* CPU time per user: ~1.1 sec +* Cost on EC2: $3.20 + +For this I bothered to write a simplistic item-item similarity metric to +take into account the additional info that is available: track, artist, +album, genre. The result was comparatively better: 17.92% error rate, good +enough for 4th place at the moment. + +Of course, the next task is to put this through the actual distributed +processing -- that's really the appropriate solution. + +This shows you can still tackle fairly impressive scale with a +non-distributed solution. These results suggest that the largest instances +available from EC2 would accomodate almost 1 billion ratings in memory. +However at that scale running a user's full recommendations would easily be +measured in seconds, not milliseconds. + + +# Clustering + +See [MAHOUT-588](https://issues.apache.org/jira/browse/MAHOUT-588) + + +# Classification + + +# Frequent Patternset Mining + Added: mahout/site/mahout_cms/content/general/mahout-wiki.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/mahout-wiki.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/mahout-wiki.mdtext (added) +++ mahout/site/mahout_cms/content/general/mahout-wiki.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,194 @@ +Title: Mahout Wiki +Apache Mahout is a new Apache TLP project to create scalable, machine +learning algorithms under the Apache license. + +{toc:style=disc|minlevel=2} + + +## General +[Overview](overview.html) + -- Mahout? What's that supposed to be? + +[Quickstart](quickstart.html) + -- learn how to quickly setup Apache Mahout for your project. + +[FAQ](faq.html) + -- Frequent questions encountered on the mailing lists. + +[Developer Resources](developer-resources.html) + -- overview of the Mahout development infrastructure. + +[How To Contribute](how-to-contribute.html) + -- get involved with the Mahout community. + +[How To Become A Committer](how-to-become-a-committer.html) + -- become a member of the Mahout development community. + +[Hadoop](http://hadoop.apache.org) + -- several of our implementations depend on Hadoop. + +[Machine Learning Open Source Software](http://mloss.org/software/) + -- other projects implementing Open Source Machine Learning libraries. + +[Mahout -- The name, history and its pronunciation](mahoutname.html) + + +## Community + +[Who we are](who-we-are.html) + -- who are the developers behind Apache Mahout? + +[Books, Tutorials, Talks, Articles, News, Background Reading, etc. on Mahout](books-tutorials-and-talks.html) + +[Issue Tracker](issue-tracker.html) + -- see what features people are working on, submit patches and file bugs. + +[Source Code (SVN)](https://svn.apache.org/repos/asf/mahout/) + -- [Fisheye|http://fisheye6.atlassian.com/browse/mahout] + -- download the Mahout source code from svn. + +[Mailing lists and IRC](mailing-lists,-irc-and-archives.html) + -- links to our mailing lists, IRC channel and archived design and +algorithm discussions, maybe your questions was answered there already? + +[Version Control](version-control.html) + -- where we track our code. + +[Powered By Mahout](powered-by-mahout.html) + -- who is using Mahout in production? + +[Professional Support](professional-support.html) + -- who is offering professional support for Mahout? + +[Mahout and Google Summer of Code](gsoc.html) + -- All you need to know about Mahout and GSoC. + + +[Glossary of commonly used terms and abbreviations](glossary.html) + + +## Installation/Setup + +[System Requirements](system-requirements.html) + -- what do you need to run Mahout? + +[Quickstart](quickstart.html) + -- get started with Mahout, run the examples and get pointers to further +resources. + +[Downloads](downloads.html) + -- a list of Mahout releases. + +[Download and installation](buildingmahout.html) + -- build Mahout from the sources. + +[Mahout on Amazon's EC2 Service](mahout-on-amazon-ec2.html) + -- run Mahout on Amazon's EC2. + +[Mahout on Amazon's EMR](mahout-on-elastic-mapreduce.html) + -- Run Mahout on Amazon's Elastic Map Reduce + +[Integrating Mahout into an Application](mahoutintegration.html) + -- integrate Mahout's capabilities in your application. + + +## Examples + +1. [ASF Email Examples](asfemail.html) + -- Examples of recommenders, clustering and classification all using a +public domain collection of 7 million emails. + + +## Implementation Background + + +### Requirements and Design + +[Matrix and Vector Needs](matrix-and-vector-needs.html) + -- requirements for Mahout vectors. + +[Collection(De-)Serialization](collection(de-)serialization.html) + + +### Collections and Algorithms + +Learn more about [mahout-collections](mahout-collections.html) +, containers for efficient storage of primitive-type data and open hash +tables. + +Learn more about the [Algorithms](algorithms.html) + discussed and employed by Mahout. + +Learn more about the [Mahout recommender implementation](recommender-documentation.html) +. + + +### Utilities + +This section describes tools that might be useful for working with Mahout. + +[Converting Content](converting-content.html) + -- Mahout has some utilities for converting content such as logs to +formats more amenable for consumption by Mahout. +[Creating Vectors](creating-vectors.html) + -- Mahout's algorithms operate on vectors. Learn more on how to generate +these from raw data. +[Viewing Result](viewing-result.html) + -- How to visualize the result of your trained algorithms. + + +### Data + +[Collections](collections.html) + -- To try out and test Mahout's algorithms you need training data. We are +always looking for new training data collections. + + +### Benchmarks + +[Mahout Benchmarks](mahout-benchmarks.html) + + +## Committer's Resources + +* [Testing](testing.html) + -- Information on test plans and ideas for testing + + +### Project Resources + +* [Dealing with Third Party Dependencies not in Maven](thirdparty-dependencies.html) +* [How To Update The Website](how-to-update-the-website.html) +* [Patch Check List](patch-check-list.html) +* [How To Release](http://cwiki.apache.org/confluence/display/MAHOUT/How+to+release) +* [Release Planning](release-planning.html) +* [Sonar Code Quality Analysis](https://analysis.apache.org/dashboard/index/63921) + + +### Additional Resources + +* [Apache Machine Status](http://monitoring.apache.org/status/) + \- Check to see if SVN, other resources are available. +* [Committer's FAQ](http://www.apache.org/dev/committers.html) +* [Apache Dev](http://www.apache.org/dev/) + + + +## How To Edit This Wiki + +How to edit this Wiki + +This Wiki is a collaborative site, anyone can contribute and share: + +* Create an account by clicking the "Login" link at the top of any page, +and picking a username and password. +* Edit any page by pressing Edit at the top of the page + +There are some conventions used on the Mahout wiki: + + * {noformat}+*TODO:*+{noformat} (+*TODO:*+ ) is used to denote sections +that definitely need to be cleaned up. + * {noformat}+*Mahout_(version)*+{noformat} (+*Mahout_0.2*+) is used to +draw attention to which version of Mahout a feature was (or will be) added +to Mahout. + Added: mahout/site/mahout_cms/content/general/mailing-lists,-irc-and-archives.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/mailing-lists%2C-irc-and-archives.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/mailing-lists,-irc-and-archives.mdtext (added) +++ mahout/site/mahout_cms/content/general/mailing-lists,-irc-and-archives.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,83 @@ +Title: Mailing Lists, IRC and Archives + +# General + +Communication at Mahout happens primarily online via mailing lists. We have +a user as well as a dev list for discussion. In addition there is a commit +list so we are able to monitor what happens on the wiki and in svn. + + +# Mailing lists + + +## Mahout User List + +This list is for users of Mahout to ask questions, share knowledge, and +discuss issues. Do send mail to this list with usage and configuration +questions and problems. Also, please send questions to this list to verify +your problem before filing issues in JIRA. + +* [Subscribe](mailto:mahout-user-subscribe@apache.org.html) +* [Unsubscribe](mailto:mahout-user-unsubscribe@apache.org.html) + + +## Mahout Developer List + +This is the list where participating developers of the Mahout project meet +and discuss issues concerning Mahout internals, code changes/additions, +etc. Do not send mail to this list with usage questions or configuration +questions and problems. + +Discussion list: + +* [Subscribe](mailto:mahout-dev-subscribe@apache.org.html) + -- Do not send mail to this list with usage questions or configuration +questions and problems. +* [Unsubscribe](mailto:mahout-dev-unsubscribe@apache.org.html) + +Commit notifications: + +* [Subscribe](mailto:mahout-commits-subscribe@apache.org.html) +* [Unsubscribe](mailto:mahout-commits-unsubscribe@apache.org.html) + + +# IRC + +Mahout's IRC channel is #mahout. It is a logged channel. Please keep in +mind that it is for discussion purposes only and that (pseudo)decisions +should be brought back to the dev@ mailing list or JIRA and other people +who are not on IRC should be given time to respond before any work is +committed. + + +# Archives + + +## Official Apache Archive + +* [http://mail-archives.apache.org/mod_mbox/mahout-dev/](http://mail-archives.apache.org/mod_mbox/mahout-dev/) +* [http://mail-archives.apache.org/mod_mbox/mahout-user/](http://mail-archives.apache.org/mod_mbox/mahout-user/) + +* [Mbox Archive](http://mahout.apache.org/mail/) + +Archives previous to becoming Apache top level project: + +* [http://mail-archives.apache.org/mod_mbox/lucene-mahout-dev/](http://mail-archives.apache.org/mod_mbox/lucene-mahout-dev/) +* [http://mail-archives.apache.org/mod_mbox/lucene-mahout-user/](http://mail-archives.apache.org/mod_mbox/lucene-mahout-user/) + +* [Mbox Archive](http://lucene.apache.org/mail/) + + +## External Archives + +* [http://www.lucidimagination.com/search](http://www.lucidimagination.com/search) + - Search the entire Lucene ecosystem, including Mahout (archives, JIRA, +etc.) Powered by Solr/Lucene. +* [MarkMail](http://mahout.markmail.org/) +* [Nabble](http://www.nabble.com/Apache-Mahout-f32040.html) +* [Gmane](http://dir.gmane.org/gmane.comp.apache.mahout.user) + +Please note the inclusion of a link to an archive does not imply an +endorsement of that company by any of the committers of Mahout the Lucene +PMC or the Apache Software Foundation. Each archive owner is solely +responsible for the contents and availability of their archive. Added: mahout/site/mahout_cms/content/general/powered-by-mahout.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/powered-by-mahout.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/powered-by-mahout.mdtext (added) +++ mahout/site/mahout_cms/content/general/powered-by-mahout.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,143 @@ +Title: Powered By Mahout +* [Intro](#PoweredByMahout-Intro) +* [Commercial Use](#PoweredByMahout-CommercialUse) +* [Academic Use](#PoweredByMahout-AcademicUse) +* [Powered By Logos](#PoweredByMahout-PoweredByLogos) + + +# Intro + +Are you using Mahout to do Machine Learning? Care to share? + +*NOTE: Please add links in alphabetical order. Links here do NOT imply +endorsement by Mahout, its committers or the Apache Software Foundation and +are for informational purposes only.* + + +# Commercial Use + +* Adobe AMP uses Mahout's clustering algorithms to increase video +consumption by better user targeting. See [http://nosql.mypopescu.com/post/2082712431/hbase-and-hadoop-at-adobe](http://nosql.mypopescu.com/post/2082712431/hbase-and-hadoop-at-adobe) +* Accenture uses Mahout as typical example for their [Hadoop Deployment Comparison Study](http://www.accenture.com/SiteCollectionDocuments/PDF/Accenture-Hadoop-Deployment-Comparison-Study.pdf) +* Amazon's Personalization Platform -- See [http://www.linkedin.com/groups/Apache-Mahout-2182513](http://www.linkedin.com/groups/Apache-Mahout-2182513) +* [AOL ](http://www.aol.com) + use Mahout for shopping recommendations. See [http://www.slideshare.net/kryton/the-data-layer] +* [Booz Allen Hamilton ](http://www.boozallen.com/) + uses Mahout's clustering algorithms. See [http://www.slideshare.net/ydn/3-biometric-hadoopsummit2010] +* [Buzzlogic](http://www.buzzlogic.com) + uses Mahout's clustering algorithms to improve ad targeting +* [Cull.tv](http://cull.tv/) + uses modified Mahout algorithms for content recommendations +* !http://cdn.dataminelab.com/favicon.ico! [DataMine Lab](http://dataminelab.com) + uses Mahout's recommendation and clustering algorithms to improve our +clients' ad targeting. +* [Drupal](http://drupal.org/project/recommender) + users Mahout to provide open source content recommendation solutions. +* [Evolv ](http://www.evolvondemand.com) + uses Mahout for its Workforce Predictive Analytics platform. +* [Foursquare](http://www.foursquare.com) + uses Mahout for its [recommendation engine |http://engineering.foursquare.com/2011/03/22/building-a-recommendation-engine-foursquare-style/] +. +* [Idealo](http://www.idealo.de) + uses Mahout's recommendation engine. +* [InfoGlutton](http://www.infoglutton.com) + uses Mahout's clustering and classification for various consulting +projects. +* [Intel](http://mark.chmarny.com/2013/07/thinking-big-about-data-at-intel.html) + ships Mahout as part of their Distribution for Apache Hadoop Software. +* [Intela](http://www.intela.com/) + has implementations of Mahout's recommendation algorithms to select new +offers to send tu customers, as well as to recommend potential customers to +current offers. We are also working on enhancing our offer categories by +using the clustering algorithms. We have a [blog post|http://intela.com/best-practices/intela-gets-smarter] + where we talk about it. +* !http://ioffer.com/favicon.ico! [iOffer](http://www.ioffer.com) + uses Mahout's Frequent Pattern Mining and Collaborative Filtering to +recommend items to users. +* !http://kau.li/favicon.ico! [Kauli](http://kau.li/en) +, one of Japanese Adnetwork, uses Mahout's clustering to handle clickstream +data for predicting audience's interests and intents. +* [Linked.In](http://linkedin.com) + Historically, we have used R for model training. We have recently started +experimenting with Mahout for model training and are excited about it - see +also [Hadoop World slides|https://www.quora.com/LinkedIn-Recommendations/How-does-LinkedIns-recommendation-system-work?srid=XoeG&share=1] +. +* [LucidWorks Big Data](http://www.lucidworks.com/products/lucidworks-big-data) + uses Mahout for clustering, duplicate document detection, phrase +extraction and classification. +* !http://mendeley.com/favicon.ico! [Mendeley](http://mendeley.com) + uses Mahout to power Mendeley Suggest, a research article recommendation +service. +* !http://mippin.com/favicon.ico! [Mippin](http://mippin.com) + uses Mahout's collaborative filtering engine to recommend news feeds +* [Mobage](http://www.slideshare.net/hamadakoichi/mobage-prmu-2011-mahout-hadoop) + uses Mahout in their analysis pipeline +* !http://myrrix.com/wp-content/uploads/2012/03/favicon.ico! [Myrrix](http://myrrix.com) + is a recommender system product built on Mahout. +* !http://www.newscred.com/media/img/favicon.ico! [NewsCred](http://platform.newscred.com) + uses Mahout to generate clusters of news articles and to surface the +important stories of the day +* [Next Glass](http://nextglass.co/) + uses Mahout [http://nextglass.co/who-we-are.html#who-3] +* [Predixion Software](http://predixionsoftware.com/predixion/NewsEvents/articleType/ArticleView/articleId/51/Predixion-Software-Demonstrating-Apache-Mahout-Integration-on-Hadoop-at-EMC-World-2012.aspx) + uses Mahout’s algorithms to build predictive models on big data +* !http://blog.radoop.eu/favicon.ico! [Radoop](http://radoop.eu) + provides a drag-n-drop interface for big data analytics, including Mahout +clustering and classification algorithms +* !https://www.researchgate.net/favicon.ico! [ResearchGate](http://www.researchgate.net/) +, the professional network for scientists and researchers, uses Mahout's +recommendation algorithms. +* [Sematext](http://www.sematext.com/) + uses Mahout for its [Recommendation Engine|http://www.sematext.com/products/recommendation-engine/index.html] +* [SpeedDate.com](http://www.speeddate.com) + uses Mahout's collaborative filtering engine to recommend member profiles +* [Twitter](http://twitter.com) + uses Mahout's LDA implementation for user interest modeling, and maintains +a (periodically sync'ed with Apache trunk) [fork|http://github.com/twitter/mahout] + of Mahout on GitHub +* [Yahoo\!](http://www.yahoo.com) + Mail uses Mahout's Frequent Pattern Set Mining. See [http://www.slideshare.net/hadoopusergroup/mail-antispam] +* !http://a.imageshack.us/img823/3443/logoyf.gif! [365Media ](http://365media.com/) + uses *Mahout's* Classification and Collaborative Filtering algorithms in +its Real-time system named [UPTIME|http://uptime.365media.com/] + and 365Media/Social + + +# Academic Use + +* [Dicode](https://www.dicode-project.eu/) + project uses Mahout's clustering and classification algorithms on top of +HBase. +* The course [Large Scale Data Analysis and Data Mining](http://www.dima.tu-berlin.de/menue/studium_und_lehre/aktuelles_semester_sommersemester_2011/aim_3_advanced_information_management) + at [TU Berlin|http://www.tu-berlin.de/] + uses Mahout to teach students about the parallelization of data +mining problems with Hadoop and Map/Reduce +* Mahout is used at Carnegie Mellon University, as a comparable platform to [GraphLab](http://www.graphlab.ml.cmu.edu/) +. +* The [ROBUST project](http://www.robust-project.eu/) +, co-funded by the European Commission, employs Mahout in the large scale +analysis of online community data. +* Mahout is used for research and data processing at [Nagoya Institute of Technology](http://www.nitech.ac.jp/eng/schools/grad/cse.html) +, in the context of a large-scale citizen participation platform project, +funded by the Ministry of Interior of Japan. +* Several researches within [Digital Enterprise Research Institute](http://www.deri.ie) + [NUI Galway|http://www.nuigalway.ie] + use Mahout for e.g. topic mining and modelling of large corpora. +* We used Mahout in the NoTube EU project, and it saved a lot of time (and a brain transplant). The only piece we've used heavily in our apps ([http://vimeo.com/user3487770http://notube.tv/](http://vimeo.com/user3487770http://notube.tv/) +) so far is the Taste recommender, but I've been digging deeper into the +other components. I can't claim we're a hugely famous or successful +application, but I can say without doubt I don't regret using Mahout. It +did what it said it would do, and easily. One nice thing about this +community, is that Mahout is not +over-marketed. If the nature or scale of your problem better suits other +tools, the Mahout folk will tell you so. + + +# Powered By Logos + +Feel free to use our Powered By logos on your site: + +!mahout-logo-poweredby-55.png|border=1! + + +!mahout-logo-poweredby-100.png|border=1! Added: mahout/site/mahout_cms/content/general/privacy-policy.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/privacy-policy.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/privacy-policy.mdtext (added) +++ mahout/site/mahout_cms/content/general/privacy-policy.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,23 @@ +Title: Privacy Policy +Information about your use of this website is collected using server access +logs and a tracking cookie. The collected information consists of the +following: + +* The IP address from which you access the website; +* The type of browser and operating system you use to access our site; +* The date and time you access our site; +* The pages you visit; and +* The addresses of pages from where you followed a link to our site. + +Part of this information is gathered using a tracking cookie set by the +Google Analytics service and handled by Google as described in their +privacy policy. See your browser documentation for instructions on how to +disable the cookie if you prefer not to share this data with Google. + +We use the gathered information to help us make our site more useful to +visitors and to better understand how and when our site is used. We do not +track or collect personally identifiable information or associate gathered +data with any personally identifying information from other sources. + +By using this website, you consent to the collection of this data in the +manner and for the purpose described above. Added: mahout/site/mahout_cms/content/general/professional-support.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/professional-support.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/professional-support.mdtext (added) +++ mahout/site/mahout_cms/content/general/professional-support.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,70 @@ +Title: Professional Support + +# Professional support for Mahout + +Add yourself or your company if you are offering support for Mahout +users.  Please keep lists in alphabetical order.  An entry here +is not an endorsement by the Apache Software Foundation nor any of it's +committers. + + + +## People and companies for hire + + + + + + + + [http://tutorteddy.com/site/free_statistics_help.php](http://tutorteddy.com/site/free_statistics_help.php) + | + + +Mahout, Hadoop, NoSQL Databases [http://www.clogeny.com](http://www.clogeny.com) + | + + + + + + + + +
Name contact details notes
Ted Dunning tdunning@apache.org limited availability
GridLine [http://www.gridline.nl/contact](http://www.gridline.nl/contact) + specialised in search and thesauri
LucidWorks (co-founded by Grant Ingersoll)[http://www.lucidworks.com](http://www.lucidworks.com) +Big data platform including Mahout as a service for clustering, +classification and more
Myrrix Ltd / Sean Owen [http://myrrix.com/contact](http://myrrix.com/contact) + Collaborative filtering, recommender and clustering platform
Sematext International [http://sematext.com/about/contact.html](http://sematext.com/about/contact.html) + [http://www.sematext.com/] +
Frank Scholten frank.scholten@orange11.nl Mahout [http://blog.orange11.nl/author/frank/](http://blog.orange11.nl/author/frank/) +
Winterwell daniel@winterwell.com business/maths concept development & +algorithms [http://winterwell.com](http://winterwell.com) +
Jagdish Nomula nomulaj@gmail.com ML, Search, Algorithms, Java [http://www.kosmex.com](http://www.kosmex.com) +
+ + +## Training and courses + + + +
Name contact details notes
+ + + + +## Talks and presentations + + + + + + + +
Name contact details notes
Isabel Drost Mail: isabel@apache.org If travel and accommodation +costs are covered scheduling a talk is a lot easier.
Frank Scholten frank@jteam.nl Mahout/Taste [http://blog.jteam.nl/author/frank/](http://blog.jteam.nl/author/frank/) +
+ +If you are looking for local Apache people please also consider having a +look at the [ASF Nearby Mentor Search ](http://community.zones.apache.org/) +. Added: mahout/site/mahout_cms/content/general/reference-reading.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/reference-reading.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/reference-reading.mdtext (added) +++ mahout/site/mahout_cms/content/general/reference-reading.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,216 @@ +Title: Reference Reading + +# General Clustering + + +## Discussions + +* [clustering tips and tricks ](http://www.lucidimagination.com/search/document/1c3561d17fc1b81c/clustering_techniques_tips_and_tricks) + + +# Text Clustering + + +## Clustering as part of Search + +* See Chapters on Hierarchical and Flat Clustering as part of search in the [stanford ir book](http://nlp.stanford.edu/IR-book/html/htmledition/irbook.html) + + +# General Background Materials + +[Q:](http://mail-archives.apache.org/mod_mbox/mahout-user/201103.mbox/%3CAANLkTi=c8kGHjvbfTCt0GyjKvvMCq=EaFsCr=_s=ykDE@mail.gmail.com%3E) +Can someone recommend me good books on Statistics and also on Linear +Algebra +and Analytic Geometry which will provide enough background for +understanding +machine learning algorithms? + + +## + +The answers below focus on general background knowledge, rather than +specifics of Mahout and associated Apache tooling. Feel free to add useful +resources (books, but also videos, online courseware, tools), particularly +those that are available free online. + +This page originated in an email thread, and its different contributors +might not all agree on the best approach (and they might not know what's +best for any given learner), but the resources here should give some idea +of suitable background reading. Check the mailing list [archives](http://mail-archives.apache.org/mod_mbox/mahout-user/) + if you care to figure out who-said-what, or find other suggestions. + +Don't be overwhelmed by all the maths, you can do a lot in Mahout with some +basic knowledge. The resources given here will help you understand your +data better, and ask better questions both of Mahout's APIs, and also of +the Mahout community. And unlike learning some particular software tool, +these are skills that will remain useful decades later. + +h3. Books and supporting materials on statistics, machine learning, +matrices etc.: + +[Gilbert Strang](http://www-math.mit.edu/~gs) +'s [Introduction to Linear Algebra|http://math.mit.edu/linearalgebra/] + (*full text* online, highly recommended by several on the mahout list). +([openlibrary](http://openlibrary.org/works/OL3285486W/Introduction_to_linear_algebra) +) +His lectures are also [available online](http://web.mit.edu/18.06/www/) + and are strongly recommended. See [http://ocw.mit.edu/courses/mathematics/18-06-linear-algebra-spring-2010/] + + +"Mathematical Tools for Applied Mulitvariate Analysis" by J.Douglass +Carroll. +([amazon](http://www.amazon.com/Mathematical-Tools-Applied-Multivariate-Analysis/dp/0121609553/ref=sr_1_1?ie=UTF8&qid=1299602805&sr=8-1) +) + + + +[Stanford Machine Learning online courseware](http://www.stanford.edu/class/cs229/) +(cs229.stanford.edu): + +"It's a very nicely taught course with super helpful lecture notes - and +you can get all the videos in youtube or [iTunesU](http://itunes.apple.com/itunes-u/machine-learning/id384233048) +" + +"The [section notes](http://www.stanford.edu/class/cs229/materials.html) + for this course will give you enough review material on linear algebra and +probability theory to get you going." + +[MIT Machine Learning online courseware](http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-867-machine-learning-fall-2006/) + (6.867) has [Lecture notes in PDF|http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-867-machine-learning-fall-2006/lecture-notes/] + online. + +As a pre-requisite to probability and statistics, you'll need [basic calculus](http://en.wikipedia.org/wiki/Calculus) +. A maths for scientists text might be useful here such as 'Mathematics +for Engineers and Scientists', Alan Jeffrey, Chapman & Hall/CRC. +([openlibrary](http://openlibrary.org/books/OL3305993M/Mathematics_for_engineers_and_scientists) +) + + +One of the best writers in the probability/statistics world is Sheldon +Ross. Try +''A First Course in Probability (8th Edition), Pearson'' ([amazon](http://www.pearsonhighered.com/educator/product/First-Course-in-Probability-A/9780136033134.page) +) and then move on to his ''Introduction to Probability Models (9th +Edition), Academic +Press.''([amazon|http://www.amazon.com/Introduction-Probability-Models-Sixth-Sheldon/dp/0125984707]) + + +Some good introductory alternatives here are: + +[Kahn Academy](http://www.khanacademy.org/) + -- videos on stats, probability, linear algebra + +Probability and Statistics (7th Edition), Jay L. Devore, Chapman. +([amazon](http://www.amazon.com/Probability-Statistics-Engineering-Sciences-InfoTrac/dp/0534399339) +) + +Probability and Statistical Inference (7th Edition), Hogg and Tanis, +Pearson. +([amazon](http://www.amazon.com/Probability-Statistical-Inference-Robert-Hogg/dp/0132546086) +) + +Once you have a grasp of the basics then there are a slew of great texts +that you might consult: for example, + +Statistical Inference, Casell and Berger, Duxbury/Thomson Learning. +([amazon](http://www.amazon.com/Statistical-Inference-George-Casella/dp/0534243126) +) + +Most statistics books will have some sort of introduction to Bayesian +methods, consider a specialist text, e.g.: + +Introduction to Bayesian Statistics (2nd Edition), William H. Bolstad, +Wiley. +([amazon](http://www.amazon.com/Introduction-Bayesian-Statistics-William-Bolstad/dp/0471270202) +) + +Then for the computational side of Bayesian (predominantly Markov chain +Monte Carlo), e.g. +Bolstad's Understanding Computational Bayesian Statistics, Wiley. +([amazon](http://www.amazon.com/Understanding-Computational-Bayesian-Statistics-Wiley/dp/0470046090) +) + +Then you might try [Bayesian Data Analysis, Gelman et al., Chapman &Hall/CRC](http://www.stat.columbia.edu/~gelman/book/) + +On top of the books, [R](http://en.wikipedia.org/wiki/R_(programming_language)) + \- is an indispensable software tool for visualizing distributions and +doing calculations + + + +(another viewpoint) + +For statistics related to machine learning, I would avoid normal +statistical texts and go with these instead + +[Pattern Recognition and Machine Learning by Chris Bishop](http://research.microsoft.com/en-us/um/people/cmbishop/PRML/index.htm) + +[Elements of Statistical Learning](http://www-stat.stanford.edu/~tibs/ElemStatLearn/) + by Trevor Hastie, Robert Tibshirani, Jerome Friedman + +Also [http://research.microsoft.com/en-us/um/people/cmbishop/PRML/index.htm](http://research.microsoft.com/en-us/um/people/cmbishop/PRML/index.htm) + +matrix computations/decomposition/factorization etc.? + +[How's this one?](http://www.amazon.com/gp/product/0801854148/ref=s9_simh_gw_p14_d0_i1?pf_rd_m=ATVPDKIKX0DER&pf_rd_s=center-3&pf_rd_r=0ESQ3KDY8MJ1AWWG8PFR&pf_rd_t=101&pf_rd_p=470938811&pf_rd_i=507846) +any idea? any other suggestion? + +I found the one by Peter V. O'Neil "Introduction to Linear Algebra", to be +a great book for beginners +(with some knowledge in calculus). It is not comprehensive, but, I believe, +it will be a good place to start and the author starts by explaining the +concepts with regards to vector spaces which I found to be a more natural +way of explaining.[http://www.amazon.com/Introduction-Linear-Algebra-Theory-Applications/dp/053400606X](http://www.amazon.com/Introduction-Linear-Algebra-Theory-Applications/dp/053400606X) + +David S. Watkins "Fundamentals of Matrix Computations (Pure and Applied +Mathematics: A Wiley Series of Texts, Monographs and Tracts)" +[http://www.amazon.com/Fundamentals-Matrix-Computations-Applied-Mathematics/dp/0470528338/](http://www.amazon.com/Fundamentals-Matrix-Computations-Applied-Mathematics/dp/0470528338/) + + + +The Gollub / Van Loan text you mention is the classic text for numerical +linear algebra. Can't go wrong with it. However, I'd also suggest you +look +at Nick Trefethen's "Numerical Linear Algebra". It's a bit more +approachable for practitioners -- GVL is better suited for researchers. +[http://people.maths.ox.ac.uk/trefethen/books.html](http://people.maths.ox.ac.uk/trefethen/books.html) +[http://people.maths.ox.ac.uk/trefethen/text.html](http://people.maths.ox.ac.uk/trefethen/text.html) + (with some online lecture notes) + + +I think this is the most relevant book for matrix math on distributed +systems: + +[http://www.amazon.com/Numerical-Linear-Algebra-Lloyd-Trefethen/dp/0898713617](http://www.amazon.com/Numerical-Linear-Algebra-Lloyd-Trefethen/dp/0898713617) +Many chapters on SVD, there are even chapters on Lanczos + + +BTW what about R? There is literally tons of books in R series devoted +to rather isolated problems but what would be a good crush course +book? + + +Ted Dunning: + +"I have found that learning about R is a difficult thing. The best +introduction I have seen is, paradoxically, not really a book about R and +assumes a statistical mind-set that I disagree with. That introduction is +in MASS [http://www.stats.ox.ac.uk/pub/MASS4/](http://www.stats.ox.ac.uk/pub/MASS4/) +. Other references also +exist: + +[http://www.r-tutor.com/r-introduction](http://www.r-tutor.com/r-introduction) +[http://cran.r-project.org/doc/manuals/R-intro.pdf](http://cran.r-project.org/doc/manuals/R-intro.pdf) +[http://faculty.washington.edu/tlumley/Rcourse/](http://faculty.washington.edu/tlumley/Rcourse/) + +In addition, you should see how to plot data well: + +[http://www.statmethods.net/advgraphs/trellis.html](http://www.statmethods.net/advgraphs/trellis.html) +[http://had.co.nz/ggplot2/](http://had.co.nz/ggplot2/) + +Generally, I learn more about R by watching people and reading code than by +reading books. There are many small tricks like how to format data +optimally, how to restructure data.frames, common ways to plot data, which +libraries do what and so on that an introductory book cannot convey general +principles that will see you through to success." + +For Javascript/Web plotting: [http://www.1stwebdesigner.com/css/top-jquery-chart-libraries-interactive-charts/](http://www.1stwebdesigner.com/css/top-jquery-chart-libraries-interactive-charts/) Added: mahout/site/mahout_cms/content/general/who-we-are.mdtext URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/general/who-we-are.mdtext?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/general/who-we-are.mdtext (added) +++ mahout/site/mahout_cms/content/general/who-we-are.mdtext Sun Nov 3 21:36:23 2013 @@ -0,0 +1,43 @@ +Title: Who We Are + +# Who we are + +Apache Mahout is maintained by a team of volunteer developers. + + +## Core Committers + +Name | Mail | Webpage | PMC | Comment +----|---------|------|------|---------- +Grant Ingersoll | gsingers@... | LucidWorks | Yes (Chair) | Twitter: @gsingers +Isabel Drost-Fromm | isabel@... | Homepage, Blog | Yes | Passion for free software (development, but to some extend also the political and economic implications), interested in agile development and project management, lives in Germany. Follow me on Twitter @MaineC +Ted Dunning | tdunning@... | MapR Technologies | Yes | +Jeff Eastman | jeastman@... | WindwardSolutions | No (Emeritus) | +Drew Farris | drew@... | | Yes | | +Sean Owen | srowen@... | Myrrix | No (Emeritus) | +Dmitriy Lyubimov | dlyubimov@... | LinkedIn profile | Yes | Twitter: @dlieuOfTwit +Paritosh Ranjan | pranjan@... | LinkedIn profile | Yes | Twitter: @paritoshranjan +Suneel Marthi | smarthi@... | VeriSign Inc. | Yes | Twitter: @suneelmarthi +Stevo Slavić| sslavic@... | LinkedIn Profile | No | Twitter: @sslavic +Gokhan Capan | gcapan@... | Dilisim | No | LinkedIn Profile +Ellen Friedman | ellenf@... | | No | Twitter: @Ellen_Friedman + + + +## Emeritus Committers + +* Niranjan Balasubramanian (nbalasub@...) +* Otis Gospodnetic (otis@...) +* David Hall (dlwh@...) +* Erik Hatcher (ehatcher@...) +* Ozgur Yilmazel (oyilmazel@...) +* Dawid Weiss (dweiss@...) +* Karl Wettin (kalle@...) +* AbdelHakim Deneche (adeneche@...) + +Note that the email addresses above end with @apache.org. + + +## Contributors + +Apache Mahout contributors and their contributions to individual issues can be found at Apache JIRA. Added: mahout/site/mahout_cms/content/images/Mahout-logo-164x200.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/Mahout-logo-164x200.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/Mahout-logo-164x200.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/Mahout-logo-245x300.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/Mahout-logo-245x300.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/Mahout-logo-245x300.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/Mahout-logo-327x400.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/Mahout-logo-327x400.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/Mahout-logo-327x400.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/Mahout-logo-82x100.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/Mahout-logo-82x100.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/Mahout-logo-82x100.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/Mahout-logo-transparent.svg URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/Mahout-logo-transparent.svg?rev=1538467&view=auto ============================================================================== --- mahout/site/mahout_cms/content/images/Mahout-logo-transparent.svg (added) +++ mahout/site/mahout_cms/content/images/Mahout-logo-transparent.svg Sun Nov 3 21:36:23 2013 @@ -0,0 +1,181 @@ + + + + + + + image/svg+xml + + Mahout + + + Lukas Vlcek + + + + Mahout project logo + 2008-05-24 + + + + + + + + + + + + + + + + + + + + + + + + + + Added: mahout/site/mahout_cms/content/images/backward.gif URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/backward.gif?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/backward.gif ------------------------------------------------------------------------------ svn:executable = * Propchange: mahout/site/mahout_cms/content/images/backward.gif ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/bg.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/bg.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/bg.png ------------------------------------------------------------------------------ svn:executable = * Propchange: mahout/site/mahout_cms/content/images/bg.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/bullet.gif URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/bullet.gif?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/bullet.gif ------------------------------------------------------------------------------ svn:executable = * Propchange: mahout/site/mahout_cms/content/images/bullet.gif ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/download-original.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/download-original.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/download-original.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream Added: mahout/site/mahout_cms/content/images/download.png URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/content/images/download.png?rev=1538467&view=auto ============================================================================== Binary file - no diff available. Propchange: mahout/site/mahout_cms/content/images/download.png ------------------------------------------------------------------------------ svn:mime-type = application/octet-stream