giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David J Garcia <djch...@utexas.edu>
Subject vertex and data block co-location
Date Sat, 16 Nov 2013 20:22:45 GMT
hello, I was wondering if there was a way to ensure that vertices located
on the same data block (on hdfs) are co-located with each other?

Also, will the vertices in input-splits (splits that are located on the
same DataNode) have a reasonable chance of being partitioned to the same id?

for example, suppose that I have vertex_1 located on data_block_i, and
vertex_2 located on data_block_k.  Let's suppose that both of the data
blocks are located on the same DataNode machine.  Is there a reasonably
good chance that the vertex_1 and vertex_2 will partition to the same id?

I'm doing a research project and I'm trying to show the benefits of graph
data-locality.

-David

Mime
View raw message