madlib-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From njayaram2 <...@git.apache.org>
Subject [GitHub] incubator-madlib pull request #112: Feautre: Add grouping support for PageRa...
Date Thu, 06 Apr 2017 18:52:17 GMT
GitHub user njayaram2 opened a pull request:

    https://github.com/apache/incubator-madlib/pull/112

    Feautre: Add grouping support for PageRank

    MADLIB-1082
    
    - Add grouping support for pagerank, which will compute a PageRank
    probability distribution for the graph represented by each group.
    - Add convergence test, so that PageRank computation terminates
    if the pagerank value of no node changes beyond a threshold across
    two consecutive iterations (or max_iters number of iterations are
    done, whichever happens first). In case of grouping, the algorithm
    terminates only after all groups have converged.
    - Create a summary table apart from the output table that records
    the number of iterations required for convergence. Iterations
    required for convergence of each group is recorded when grouping
    is used. This implementation also ensures that we don't compute
    PageRank for groups that have already converged.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/njayaram2/incubator-madlib feature/pagerank_grouping

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-madlib/pull/112.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #112
    
----
commit 5b95581e0dd9981086f17097e46a59376ce0b603
Author: Nandish Jayaram <njayaram@apache.org>
Date:   2017-04-01T00:03:50Z

    Feautre: Add grouping support for PageRank
    
    MADLIB-1082
    
    - Add grouping support for pagerank, which will compute a PageRank
    probability distribution for the graph represented by each group.
    - Add convergence test, so that PageRank computation terminates
    if the pagerank value of no node changes beyond a threshold across
    two consecutive iterations (or max_iters number of iterations are
    done, whichever happens first). In case of grouping, the algorithm
    terminates only after all groups have converged.
    - Create a summary table apart from the output table that records
    the number of iterations required for convergence. Iterations
    required for convergence of each group is recorded when grouping
    is used. This implementation also ensures that we don't compute
    PageRank for groups that have already converged.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message