mahout-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted2Dunning (Confluence)" <conflue...@apache.org>
Subject [CONF] Apache Mahout > Release 0.8
Date Mon, 08 Jul 2013 06:06:00 GMT
<html>
<head>
    <base href="https://cwiki.apache.org/confluence">
            <link rel="stylesheet" href="/confluence/s/en/2176/1/186/_/styles/combined.css?spaceKey=MAHOUT&amp;forWysiwyg=true"
type="text/css">
    </head>
<body style="background: white;" bgcolor="white" class="email-body">
<div id="pageContent">
<div id="notificationFormat">
<div class="wiki-content">
<div class="email">
    <h2><a href="https://cwiki.apache.org/confluence/display/MAHOUT/Release+0.8">Release
0.8</a></h2>
    <h4>Page <b>edited</b> by             <a href="https://cwiki.apache.org/confluence/display/~ted.dunning@gmail.com">Ted2Dunning</a>
    </h4>
        <div id="versionComment">
        <b>Comment:</b>
        Added streaming k-means mention<br />
    </div>
        <br/>
                         <h4>Changes (4)</h4>
                                 
    
<div id="page-diffs">
                    <table class="diff" cellpadding="0" cellspacing="0">
    
            <tr><td class="diff-snipped" >...<br></td></tr>
            <tr><td class="diff-unchanged" >The highlights of the Apache Mahout
0.8 release include, but are not limited to the list below.  For further information, see
the included CHANGELOG file. <br> <br></td></tr>
            <tr><td class="diff-changed-lines" >- Numerous performance improvements
to Vector and Matrix <span class="diff-changed-words">implementations<span class="diff-added-chars"style="background-color:
#dfd;">, API&#39;s</span></span> and their iterators <br></td></tr>
            <tr><td class="diff-unchanged" >- MAHOUT-944:  Support for converting
one or more Lucene storage indexes to SequenceFiles. <br></td></tr>
            <tr><td class="diff-deleted-lines" style="color:#999;background-color:#fdd;text-decoration:line-through;">-
 FILL IN HERE <br></td></tr>
            <tr><td class="diff-added-lines" style="background-color: #dfd;">-
MAHOUT-1154 and friends: New streaming k-means implementation that offers on-line (and fast)
clustering <br></td></tr>
            <tr><td class="diff-unchanged" >- The usual bug fixes.  See JIRA for
more information on the 0.8 release. <br> <br></td></tr>
            <tr><td class="diff-added-lines" style="background-color: #dfd;">A
total of 174 separate JIRA issues are addressed in this release. <br></td></tr>
            <tr><td class="diff-unchanged" > <br>FUTURE PLANS <br></td></tr>
            <tr><td class="diff-snipped" >...<br></td></tr>
    
            </table>
    </div>                            <h4>Full Content</h4>
                    <div class="notificationGreySide">
        <p>DRAFT RELEASE NOTES FOR 0.8</p>
<blockquote>
<p>The Apache Mahout PMC is pleased to announce the release of Mahout 0.8.  Mahout's
goal is to build scalable machine learning libraries focused primarily in the areas of collaborative
filtering (recommenders), clustering and classification (known as the "3Cs"), as well as the
necessary infrastructure to support those implementations including, but not limited to, math
packages for statistics, linear algebra and others as well as Java primitive collections,
local and distributed vector and matrix classes and a variety of integrative code to work
with popular packages like Apache Lucene, Apache HBase, Apache Cassandra and much more.</p>

<p>To get started with Apache Mahout 0.8, download the release artifacts and signatures
at FILL IN HERE.  The examples directory contains several working examples of the core functionality
available in Mahout.  These can be run via scripts in the examples/bin directory.  Most examples
do not need a Hadoop cluster in order to run.</p>

<p>Please pay attention to the section labelled FUTURE PLANS below for more information
about upcoming releases of Mahout.</p>

<p>As with any release, we wish to thank all of the users and contributors to Mahout.
 Please see the CHANGELOG and JIRA for individual credits, as there are too many to list here.</p>

<p>CONTRIBUTING</p>

<p>Mahout is always looking for contributions focused on the 3Cs.  If you are interested
in contributing, please see our <a href="https://cwiki.apache.org/MAHOUT/how-to-contribute.html"
class="external-link" rel="nofollow">https://cwiki.apache.org/MAHOUT/how-to-contribute.html</a>
on the Mahout wiki.</p>

<p>RELEASE HIGHLIGHTS</p>

<p>The highlights of the Apache Mahout 0.8 release include, but are not limited to the
list below.  For further information, see the included CHANGELOG file.</p>

<ul class="alternate" type="square">
	<li>Numerous performance improvements to Vector and Matrix implementations, API's and
their iterators</li>
	<li>MAHOUT-944:  Support for converting one or more Lucene storage indexes to SequenceFiles.</li>
	<li>MAHOUT-1154 and friends: New streaming k-means implementation that offers on-line
(and fast) clustering</li>
	<li>The usual bug fixes.  See JIRA for more information on the 0.8 release.</li>
</ul>


<p>A total of 174 separate JIRA issues are addressed in this release.</p>

<p>FUTURE PLANS</p>

<p>0.9</p>

<p>As the project moves towards a 1.0 release, the community is working to clean up
and/or remove parts of the code base that are under-supported or that underperform as well
as to better focus the energy and contributions on key algorithms that are proven to scale
in production and have seen wide-spread adoption.  To this end, in the next release, the project
is planning on removing support for the following algorithms unless there is sustained support
and improvement of them before the next release.</p>

<p>The algorithms to be removed are:</p>
<ul class="alternate" type="square">
	<li>From Clustering:<br/>
	Dirichlet<br/>
	MeanShift<br/>
	MinHash</li>
	<li>From Classification (both are sequential implementations)<br/>
	Winnow<br/>
	Perceptron</li>
	<li>Frequent Pattern Mining</li>
	<li>Collaborative Filtering<br/>
	GSI: DO ANY GO HERE?</li>
</ul>


<p>If you are interested in supporting 1 or more of these algorithms, please make it
known on dev@mahout.apache.org and via JIRA issues that fix and/or improve them.  Please also
provide supporting evidence as to there effectiveness for you in production.</p>

<p>1.0 PLANS</p>

<p>Our plans as a community are to focus 0.9 on cleanup of bugs and the removal of the
code mentioned above and then to follow with a 1.0 release soon thereafter, at which point
the community is committing to the support of the algorithms packaged in the 1.0 for at least
two minor versions after their release.  In the case of removal, we will deprecate the functionality
in the 1.(x+1) minor release and remove it in the 1.(x+2) release.  For instance, if feature
X is to be removed after the 1.2 release, it will be deprecated in 1.3 and removed in 1.4.</p></blockquote>
    </div>
        <div id="commentsSection" class="wiki-content pageSection">
        <div style="float: right;" class="grey">
                        <a href="https://cwiki.apache.org/confluence/users/removespacenotification.action?spaceKey=MAHOUT">Stop
watching space</a>
            <span style="padding: 0px 5px;">|</span>
                <a href="https://cwiki.apache.org/confluence/users/editmyemailsettings.action">Change
email notification preferences</a>
</div>
        <a href="https://cwiki.apache.org/confluence/display/MAHOUT/Release+0.8">View
Online</a>
        |
        <a href="https://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=31824295&revisedVersion=2&originalVersion=1">View
Changes</a>
                |
        <a href="https://cwiki.apache.org/confluence/display/MAHOUT/Release+0.8?showComments=true&amp;showCommentArea=true#addcomment">Add
Comment</a>
            </div>
</div>
</div>
</div>
</div>
</body>
</html>

Mime
View raw message