mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From build...@apache.org
Subject svn commit: r901786 - in /websites/staging/mahout/trunk/content: ./ users/clustering/visualizing-sample-clusters.html
Date Sun, 16 Mar 2014 08:08:58 GMT
Author: buildbot
Date: Sun Mar 16 08:08:57 2014
New Revision: 901786

Log:
Staging update by buildbot for mahout

Modified:
    websites/staging/mahout/trunk/content/   (props changed)
    websites/staging/mahout/trunk/content/users/clustering/visualizing-sample-clusters.html

Propchange: websites/staging/mahout/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Sun Mar 16 08:08:57 2014
@@ -1 +1 @@
-1577980
+1577992

Modified: websites/staging/mahout/trunk/content/users/clustering/visualizing-sample-clusters.html
==============================================================================
--- websites/staging/mahout/trunk/content/users/clustering/visualizing-sample-clusters.html
(original)
+++ websites/staging/mahout/trunk/content/users/clustering/visualizing-sample-clusters.html
Sun Mar 16 08:08:57 2014
@@ -205,71 +205,43 @@
     <p><a name="VisualizingSampleClusters-Introduction"></a></p>
 <h1 id="introduction">Introduction</h1>
 <p>Mahout provides examples to visualize sample clusters that gets created by
-various clustering algorithms like
-<em> Canopy Clustering
-</em> Dirichlet Process
-<em> KMeans
-</em> Fuzzy KMeans
-<em> MeanShift Canopy
-</em> Spectral KMeans
-* MinHash</p>
-<p><a name="VisualizingSampleClusters-Note"></a></p>
-<h5 id="note">Note</h5>
-<p>These are Swing programs. You have to be in a window system on the same
-machine you run these, or logged in via a "remote desktop" or VNC program.</p>
+our clustering algorithms. Note that the visualization is done by Swing programs. You have
to be in a window system on the same
+machine you run these, or logged in via a remote desktop.</p>
 <p><a name="VisualizingSampleClusters-Pre-Prep"></a></p>
 <h1 id="pre-prep">Pre - Prep</h1>
-<p>For visualizing the clusters, you would just have to execute the Java
-classes under org.apache.mahout.clustering.display package in
-mahout-examples module. If you are using eclipse, setup mahout-examples as
-a project as specified in <a href="buildingmahout#mahout_maven_eclipse.html">Working
with Maven in Eclipse</a>
-.</p>
+<p>For visualizing the clusters, you have to execute the Java
+classes under <em>org.apache.mahout.clustering.display</em> package in
+mahout-examples module. The easiest way to achieve this is to <a href="users/basics/quickstart.html">setup
Mahout</a> in your IDE.</p>
 <p><a name="VisualizingSampleClusters-Visualizingclusters"></a></p>
 <h1 id="visualizing-clusters">Visualizing clusters</h1>
-<p>The following classes in org.apache.mahout.clustering.display can be run
+<p>The following classes in <em>org.apache.mahout.clustering.display</em>
can be run
 without parameters to generate a sample data set and run the reference
-clustering implementations over them:
-1. DisplayClustering - generates 1000 samples from three, symmetric
+clustering implementations over them:</p>
+<ol>
+<li><strong>DisplayClustering</strong> - generates 1000 samples from three,
symmetric
 distributions. This is the same data set that is used by the following
 clustering programs. It displays the points on a screen and superimposes
 the model parameters that were used to generate the points. You can edit
-the generateSamples() method to change the sample points used by these
-programs.
-1. DisplayClustering - displays initial areas of generated points
-1. DisplayDirichlet - uses Dirichlet Process clustering
-1. DisplayCanopy - uses Canopy clustering
-1. DisplayKMeans - uses k-Means clustering
-1. DisplayFuzzyKMeans - uses Fuzzy k-Means clustering
-1. DisplayMeanShift - uses MeanShift clustering
-1. DisplaySpectralKMeans - uses Spectral KMeans via map-reduce algorithm</p>
-<p>If you are using Eclipse and have set it up as specified in Pre-Prep, just
-right-click on each of the classes mentioned above and choose "Run As -
-Java Application". To run these directly from the command line:</p>
+the <em>generateSamples()</em> method to change the sample points used by these
+programs.</li>
+<li><strong>DisplayClustering</strong> - displays initial areas of generated
points</li>
+<li><strong>DisplayCanopy</strong> - uses Canopy clustering</li>
+<li><strong>DisplayKMeans</strong> - uses k-Means clustering</li>
+<li><strong>DisplayFuzzyKMeans</strong> - uses Fuzzy k-Means clustering</li>
+<li><strong>DisplaySpectralKMeans</strong> - uses Spectral KMeans via map-reduce
algorithm</li>
+</ol>
+<p>If you are using Eclipse, just right-click on each of the classes mentioned above
and choose "Run As -Java Application". To run these directly from the command line:</p>
 <div class="codehilite"><pre><span class="n">cd</span> $<span
class="n">MAHOUT_HOME</span><span class="o">/</span><span class="n">examples</span>
-<span class="n">mvn</span> <span class="o">-</span><span class="n">q</span>
<span class="n">exec</span><span class="p">:</span><span class="n">java</span>
+<span class="n">mvn</span> <span class="o">-</span><span class="n">q</span>
<span class="n">exec</span><span class="p">:</span><span class="n">java</span>
<span class="o">-</span><span class="n">Dexec</span><span class="p">.</span><span
class="n">mainClass</span><span class="p">=</span><span class="n">org</span><span
class="p">.</span><span class="n">apache</span><span class="p">.</span><span
class="n">mahout</span><span class="p">.</span><span class="n">clustering</span><span
class="p">.</span><span class="n">display</span><span class="p">.</span><span
class="n">DisplayClustering</span>
 </pre></div>
 
 
-<p>-Dexec.mainClass=org.apache.mahout.clustering.display.DisplayClustering
-    # substitute other names above for DisplayClustering
-    # Note: the DisplaySpectralKMeans program does a Hadoop job that takes 3
-minutes on a laptop. Set this MVN_OPTS=300m to give the program enough
-memory. You may find that some of the other programs also need more memory.</p>
-<p>Note:
-* Some of these programs display the sample points and then superimpose all
-of the clusters from each iteration. The last iteration's clusters are in
+<p>You can substitute other names above for <em>DisplayClustering</em>.
</p>
+<p>Note that some of these programs display the sample points and then superimpose
all of the clusters from each iteration. The last iteration's clusters are in
 bold red and the previous several are colored (orange, yellow, green, blue,
 magenta) in order after which all earlier clusters are in light grey. This
 helps to visualize how the clusters converge upon a solution over multiple
 iterations.</p>
-<ul>
-<li>By changing the parameter values (k, ALPHA_0, numIterations) and the
-display SIGNIFICANCE you can obtain different results.</li>
-</ul>
-<p><a name="VisualizingSampleClusters-ScreenCaptureAnimation"></a></p>
-<h1 id="screen-capture-animation">Screen Capture Animation</h1>
-<p>See <a href="sample-clusters-animation.html">Sample Clusters Animation</a>
- for a screen caps of all the above programs, and an animated gif.</p>
    </div>
   </div>     
 </div> 



Mime
View raw message