giraph-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gianmarco De Francisci Morales (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (GIRAPH-235) SequenceFile output format (id-value only)
Date Wed, 04 Jul 2012 16:08:33 GMT

     [ https://issues.apache.org/jira/browse/GIRAPH-235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Gianmarco De Francisci Morales updated GIRAPH-235:
--------------------------------------------------

    Attachment: GIRAPH-235.1.patch

I have an initial patch for this, but I encountered some problems which I was not able to
figure out. Namely, I get this exception when I try to use this class with GiraphRunner:

{code}
java.lang.IllegalStateException: run: Caught an unrecoverable exception wrong value class:
org.apache.hadoop.io.DoubleWritable is not class org.apache.hadoop.io.Text
	at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:666)
	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
	at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:396)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1093)
	at org.apache.hadoop.mapred.Child.main(Child.java:249)
Caused by: java.io.IOException: wrong value class: org.apache.hadoop.io.DoubleWritable is
not class org.apache.hadoop.io.Text
	at org.apache.hadoop.io.SequenceFile$Writer.append(SequenceFile.java:1046)
	at org.apache.hadoop.mapreduce.lib.output.SequenceFileOutputFormat$1.write(SequenceFileOutputFormat.java:74)
	at org.apache.giraph.examples.VertexIDValueBinOutputFormat$VertexIdValueWriter.writeVertex(VertexIDValueBinOutputFormat.java:40)
	at org.apache.giraph.graph.BspServiceWorker.saveVertices(BspServiceWorker.java:1084)
	at org.apache.giraph.graph.BspServiceWorker.cleanup(BspServiceWorker.java:1094)
	at org.apache.giraph.graph.GraphMapper.cleanup(GraphMapper.java:634)
	at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:661)
{code}


My vertex is a RandomWalkWithRestartVertex which extends (via 2 indirect levels of inheritance)
MutableVertex<LongWritable, DoubleWritable, FloatWritable, DoubleWritable>

I guess I am misconfiguring something.
                
> SequenceFile output format (id-value only)
> ------------------------------------------
>
>                 Key: GIRAPH-235
>                 URL: https://issues.apache.org/jira/browse/GIRAPH-235
>             Project: Giraph
>          Issue Type: New Feature
>          Components: lib
>            Reporter: Gianmarco De Francisci Morales
>         Attachments: GIRAPH-235.1.patch
>
>
> Create a SequenceFileOutputFormat for the cases where compression is important and we
only want the value of the vertex (e.g. pagerank)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message