Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 65240 invoked from network); 30 Jan 2010 02:02:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Jan 2010 02:02:20 -0000 Received: (qmail 47788 invoked by uid 500); 30 Jan 2010 02:02:20 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 47731 invoked by uid 500); 30 Jan 2010 02:02:19 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 47721 invoked by uid 99); 30 Jan 2010 02:02:19 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Jan 2010 02:02:19 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of viniciusccarvalho@gmail.com designates 209.85.216.186 as permitted sender) Received: from [209.85.216.186] (HELO mail-px0-f186.google.com) (209.85.216.186) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 30 Jan 2010 02:02:10 +0000 Received: by pxi16 with SMTP id 16so2541591pxi.29 for ; Fri, 29 Jan 2010 18:01:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:from:date:message-id :subject:to:content-type; bh=BBbOMk8sJt+d4nCW+YpYvhAilvqCoEqRw5wCaXVlWi4=; b=hQC3sTzxVPmutzOjAOFRgNef554vdoLrcjVBP4ldrHR7VwtDANfRHQCbm6ZRoO8vUJ 2WX50mLXN/WrsJtFdzeIAyd8rgzvYU4GnbWsdrm+qW9NH36CMjn5C0gvi94ZvZ3JrmG1 lvO+LbCepvYVT1bHJ7E7wqmyq7kzUnYzPHTCg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type; b=dFGqyQ0kYmw4cqfX5TlM8yr97644vBAjalrCh61Dvo7XKZmd6hEgqRzoxA58Km0zlO 3n2vU7p2/nXt51RSZHl5zEK26SnnKmq/EZKT6j7sgQVReHV8sf6ceWX/M4GY7pJX4+Fa m9QSdlggwhZ2+cbKhJ4OdPROexSK0w7lufVQ0= MIME-Version: 1.0 Received: by 10.142.152.30 with SMTP id z30mr1075775wfd.293.1264816909248; Fri, 29 Jan 2010 18:01:49 -0800 (PST) From: Vinicius Carvalho Date: Sat, 30 Jan 2010 00:01:29 -0200 Message-ID: <9d2777b61001291801w66fe0b45ldd76e90bab5a28f1@mail.gmail.com> Subject: Getting Taste to work on 10M dataset To: mahout-user@lucene.apache.org Content-Type: multipart/alternative; boundary=000e0cd2da8694eb1e047e582211 X-Virus-Checked: Checked by ClamAV on apache.org --000e0cd2da8694eb1e047e582211 Content-Type: text/plain; charset=ISO-8859-1 Hello there! I'm trying to get Taste to work on the 10M dataset but, even following some tips from Sean on Mahout in Action, I can't get it working using SlopeOneRecommender and JDBC. The machine I run the examples is a Core 2 Duo 2.8 Ghz with 4GB RAM Ubuntu 9.10 64bit JDK 1.6 The Mysql is set to use up to 512MB of table cache The JVM is running with Xmx=2048mb I'm using spring to make things simpler but bottom line I create a slopeone recommender using the constructor: DataModel:JDBCDataModel weighting:Weighted weighting:Weighted diffStorage:memory The memory storage is configured: DataModel:JDBCDataModel weighting:Weighted compact:false maxEntries:100000 Running the code I get this exception: Caused by: java.lang.NullPointerException at com.mysql.jdbc.ResultSetImpl.getLong(ResultSetImpl.java:2843) at com.mysql.jdbc.ResultSetImpl.getLong(ResultSetImpl.java:2830) at org.apache.commons.dbcp.DelegatingResultSet.getLong(DelegatingResultSet.java:190) at org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel.getLongColumn(AbstractJDBCDataModel.java:602) at org.apache.mahout.cf.taste.impl.model.jdbc.AbstractJDBCDataModel$ResultSetIDIterator.nextLong(AbstractJDBCDataModel.java:677) at org.apache.mahout.cf.taste.impl.recommender.slopeone.MemoryDiffStorage.buildAverageDiffs(MemoryDiffStorage.java:221) at org.apache.mahout.cf.taste.impl.recommender.slopeone.MemoryDiffStorage.(MemoryDiffStorage.java:115) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at org.springframework.beans.BeanUtils.instantiateClass(BeanUtils.java:100) ... 57 more I tried to replace the MemoryDiff by a JDBCDiff. After almost 1 hour running the code, and 100% CPU usage by my mysql process. I decided to quit, is it supposed to take so long? I tried to change the maxEntries to a smaller value but the NullPointer always happens. My table is not exactly like the one used as sample on the source code, but I do inform the correct columns and they are of the same type. Any ideas? -- The intuitive mind is a sacred gift and the rational mind is a faithful servant. We have created a society that honors the servant and has forgotten the gift. --000e0cd2da8694eb1e047e582211--