mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "cumtyjh" <cumt...@163.com>
Subject got Error: GC overhead limit exceeded when generate product similariy
Date Mon, 09 Nov 2009 02:28:02 GMT
hi,all

i got some error when generate product similarity according to rating file, and there is about
250,000 recordes in rating file.

it works when there is only 10,000 recordes in rating file.


do you have some suggestion? any help is appreciated

thanks in advance.


following is code and log:


 File file = new File(ratingFile);
 logger.log(Level.INFO, "begin to load rating file...");
 FileDataModel model = new FileDataModel(file);
 logger.log(Level.INFO, "load rating file OK.");
ItemSimilarity pearson = new LogLikelihoodSimilarity(model);
GenericItemSimilarity gif = new GenericItemSimilarity(pearson,model);



INFO: load rating file OK.
- Reading file info...
- Processed 100000 lines
- Processed 200000 lines
Exception in thread "Thread-9" java.lang.OutOfMemoryError: GC overhead limit exceeded
at org.apache.mahout.cf.taste.impl.common.FastSet.<init>(FastSet.java:74)
at org.apache.mahout.cf.taste.impl.model.GenericDataModel.getNumUsersWithPreferenceFor(GenericDataModel.java:195)
at org.apache.mahout.cf.taste.impl.model.file.FileDataModel.getNumUsersWithPreferenceFor(FileDataModel.java:314)
at org.apache.mahout.cf.taste.impl.similarity.LogLikelihoodSimilarity.itemSimilarity(LogLikelihoodSimilarity.java:48)
at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity$DataModelSimilaritiesIterator.next(GenericItemSimilarity.java:291)
at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity$DataModelSimilaritiesIterator.next(GenericItemSimilarity.java:260)
at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity.initSimilarityMaps(GenericItemSimilarity.java:128)
at org.apache.mahout.cf.taste.impl.similarity.GenericItemSimilarity.<init>(GenericItemSimilarity.java:103)

2009-11-09 



cumtyjh 

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message