Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 81495 invoked from network); 12 Jan 2010 23:46:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Jan 2010 23:46:42 -0000 Received: (qmail 83347 invoked by uid 500); 12 Jan 2010 23:46:41 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 83299 invoked by uid 500); 12 Jan 2010 23:46:41 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 83289 invoked by uid 99); 12 Jan 2010 23:46:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Jan 2010 23:46:41 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of bogdan.vatkov@gmail.com designates 209.85.219.217 as permitted sender) Received: from [209.85.219.217] (HELO mail-ew0-f217.google.com) (209.85.219.217) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Jan 2010 23:46:34 +0000 Received: by ewy9 with SMTP id 9so6477199ewy.11 for ; Tue, 12 Jan 2010 15:46:13 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=SQBQ6GXN5EsPsJMRncpTBpNdreVG0UrvyxfbNhnEs/Y=; b=lHr4K7ROXwIKcPygvwTssAoOI9N1/Z0U60jSKzBL7Fc79Hb6vdgS1B3x1NoexJ6fHb XCTevoX70amsKMAxjwaSvKgLdlZqq9MOBeuZV11NiSBJcgLL2j6I9qXL/ZwcL9eTGvP5 uJJtryxT6FwbkdRIV/CnCA8AcYnzwWzbq2J3s= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=IT8L85MKBF7DOPvkHZFMbYMembc1gyqYDJuI8lEh4hMjVyFjuOpF1SQyuJXBWQ/oek FW1uzpqDloAO/VMH+GJnZpKZ+jP5C4MjTl0nJYfvxVRYwYJ3konoH/tsX0+elP8JrSmY FRHhtl0nneNkfCSJFIzDQkc2bDuolTIGEdIfo= MIME-Version: 1.0 Received: by 10.213.37.194 with SMTP id y2mr55495ebd.54.1263339973487; Tue, 12 Jan 2010 15:46:13 -0800 (PST) Date: Wed, 13 Jan 2010 01:46:13 +0200 Message-ID: Subject: CardinalityException in DirichletDriver From: Bogdan Vatkov To: mahout-user@lucene.apache.org Content-Type: multipart/alternative; boundary=001485318beb59b2af047d004247 --001485318beb59b2af047d004247 Content-Type: text/plain; charset=ISO-8859-1 what could be the reason for this Cardinality exception? 10/01/13 01:41:09 INFO clustering.SolrToMahoutDriver: Wrote: 174 vectors 10/01/13 01:41:09 INFO clustering.SolrToMahoutDriver: Dictionary Output file: /store/dev/inst/mahout-0.2/email-clustering/1-solr-vectors/dictionary.txt 10/01/13 01:41:11 INFO dirichlet.DirichletDriver: Iteration 0 10/01/13 01:41:11 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=JobTracker, sessionId= 10/01/13 01:41:11 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same. 10/01/13 01:41:11 INFO mapred.FileInputFormat: Total input paths to process : 1 10/01/13 01:41:11 INFO mapred.JobClient: Running job: job_local_0001 10/01/13 01:41:11 INFO mapred.FileInputFormat: Total input paths to process : 1 10/01/13 01:41:11 INFO compress.CodecPool: Got brand-new decompressor 10/01/13 01:41:11 INFO mapred.MapTask: numReduceTasks: 1 10/01/13 01:41:11 INFO mapred.MapTask: io.sort.mb = 100 10/01/13 01:41:12 INFO mapred.MapTask: data buffer = 79691776/99614720 10/01/13 01:41:12 INFO mapred.MapTask: record buffer = 262144/327680 10/01/13 01:41:12 WARN mapred.LocalJobRunner: job_local_0001 org.apache.mahout.matrix.CardinalityException at org.apache.mahout.matrix.AbstractVector.dot(AbstractVector.java:92) at org.apache.mahout.clustering.dirichlet.models.NormalModel.pdf(NormalModel.java:111) at org.apache.mahout.clustering.dirichlet.models.NormalModel.pdf(NormalModel.java:28) at org.apache.mahout.clustering.dirichlet.DirichletState.adjustedProbability(DirichletState.java:129) at org.apache.mahout.clustering.dirichlet.DirichletMapper.normalizedProbabilities(DirichletMapper.java:111) at org.apache.mahout.clustering.dirichlet.DirichletMapper.map(DirichletMapper.java:47) at org.apache.mahout.clustering.dirichlet.DirichletMapper.map(DirichletMapper.java:38) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:176) 10/01/13 01:41:12 INFO mapred.JobClient: map 0% reduce 0% 10/01/13 01:41:12 INFO mapred.JobClient: Job complete: job_local_0001 10/01/13 01:41:12 INFO mapred.JobClient: Counters: 0 10/01/13 01:41:12 WARN dirichlet.DirichletDriver: java.io.IOException: Job failed! java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1252) at org.apache.mahout.clustering.dirichlet.DirichletDriver.runIteration(DirichletDriver.java:214) at org.apache.mahout.clustering.dirichlet.DirichletDriver.runJob(DirichletDriver.java:139) at org.apache.mahout.clustering.dirichlet.DirichletDriver.main(DirichletDriver.java:109) at org.bogdan.clustering.mbeans.Clusters.doClustering(Clusters.java:244) at org.bogdan.clustering.mbeans.Clusters.access$0(Clusters.java:175) at org.bogdan.clustering.mbeans.Clusters$1.run(Clusters.java:148) at java.lang.Thread.run(Thread.java:619) --001485318beb59b2af047d004247--