mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yang Sun <>
Subject java heap space exception using org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver
Date Mon, 07 Feb 2011 19:55:09 GMT

I'm trying to use parallel FPGrowth on a text based data set with
about 14K documents. But when I run mahout, I got the following

FATAL org.apache.hadoop.mapred.TaskTracker: Error running child :
java.lang.OutOfMemoryError: Java heap space
	at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.<init>(
	at org.apache.hadoop.mapred.MapTask$NewOutputCollector.<init>(
	at org.apache.hadoop.mapred.MapTask.runNewMapper(
	at org.apache.hadoop.mapred.Child.main(

The command:

hadoop jar mahout-examples-0.5-SNAPSHOT-job.jar
org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver -i tital_tokens -o
patterns -k 3000 -method mapreduce -g 10 -regex '[\ ]' -s 10

Can someone tell me how I can fix this? Or is it possible to use the
algorithm for a text based dataset?



  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message