mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yang Sun <soushare....@gmail.com>
Subject java heap space exception using org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver
Date Mon, 07 Feb 2011 19:55:09 GMT
Hi,

I'm trying to use parallel FPGrowth on a text based data set with
about 14K documents. But when I run mahout, I got the following
exception:

FATAL org.apache.hadoop.mapred.TaskTracker: Error running child :
java.lang.OutOfMemoryError: Java heap space
	at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.<init>(MapTask.java:781)
	at org.apache.hadoop.mapred.MapTask$NewOutputCollector.<init>(MapTask.java:524)
	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:613)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
	at org.apache.hadoop.mapred.Child.main(Child.java:170)


The command:

hadoop jar mahout-examples-0.5-SNAPSHOT-job.jar
org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver -i tital_tokens -o
patterns -k 3000 -method mapreduce -g 10 -regex '[\ ]' -s 10


Can someone tell me how I can fix this? Or is it possible to use the
algorithm for a text based dataset?


Thanks

Yang

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message