giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alessandro Presta <alessan...@fb.com>
Subject Re: Input formats
Date Fri, 22 Mar 2013 18:36:02 GMT
The first one looks like IntNullTextEdgeInputFormat. For the second one you can do something
similar to AdjacencyListTextVertexInputFormat, except that you don't have edge values.
It's not a requirement to have the input split across several files. It's desired if you have
a big dataset, but HDFS should handle that.

From: Rui Sarmento <rui_sarmento@hotmail.com<mailto:rui_sarmento@hotmail.com>>
Reply-To: "user@giraph.apache.org<mailto:user@giraph.apache.org>" <user@giraph.apache.org<mailto:user@giraph.apache.org>>
Date: Friday, March 22, 2013 11:29 AM
To: Giraph Support <user@giraph.apache.org<mailto:user@giraph.apache.org>>
Subject: Input formats

Hi all,

What are the input formats to use for edge lists and adjacency lists in the following layout:

For Edge List (undirected):

1  3
3  2
4  8
5  3
.   .
.   .
.   .

and for adjacency lists like:

1   2
2   3
3   1   2   5
4   8
5   3
.    .
.    .
.    .

and other question, is it requirement to have the network divided in several part files in
hdfs?

Thanks very much for your help.

Regards

Mime
View raw message