Unfortunately, we dropped the support for PageRank. For performance
reasons, our implementation assumed that the pageRank vector fits into
memory, making it unsuitable for very large graphs.
I'd recommend you have a look at Apache Giraph, a framework dedicated to
large scale graph processing.
On 03.08.2012 10:27, Yan Liu (JIRA) wrote:
> Yan Liu created MAHOUT-1049:
> -------------------------------
>
> Summary: out of memory error when running PageRank
> Key: MAHOUT-1049
> URL: https://issues.apache.org/jira/browse/MAHOUT-1049
> Project: Mahout
> Issue Type: Improvement
> Reporter: Yan Liu
>
>
> We always met a 'out of memory' error when running PageRank. Since we have to run large-scale
data, is there any way for improvement?
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>
>
>
|