Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E2C3266CB for ; Mon, 4 Jul 2011 12:30:50 +0000 (UTC) Received: (qmail 28464 invoked by uid 500); 4 Jul 2011 12:30:49 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 28402 invoked by uid 500); 4 Jul 2011 12:30:49 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 27890 invoked by uid 99); 4 Jul 2011 12:30:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Jul 2011 12:30:48 +0000 X-ASF-Spam-Status: No, hits=4.0 required=5.0 tests=FREEMAIL_FROM,FREEMAIL_REPLY,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of robin.anil@gmail.com designates 209.85.215.42 as permitted sender) Received: from [209.85.215.42] (HELO mail-ew0-f42.google.com) (209.85.215.42) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Jul 2011 12:30:41 +0000 Received: by ewy2 with SMTP id 2so3605744ewy.1 for ; Mon, 04 Jul 2011 05:30:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=MuhjTpQnvXvkVPw9+LcO4Kb45mS21M4j1CgNuxLI4Wo=; b=WGnli2I/8i1YClWuk3P8xJRkbe5pnd6xeVxhzalECpYdMMZmOOmRae3XoLUwOo8cjI 8G9S7f9rnatv4a9hWjlBtVaBQAaVQnCPzACVg8LGrdl15pyUi/vVZVSls/sDwpltiDPh rXxYdU2bpK2fwBPos3HrAaqq7RZaPIcEek6ec= Received: by 10.14.20.16 with SMTP id o16mr1158983eeo.158.1309782620146; Mon, 04 Jul 2011 05:30:20 -0700 (PDT) MIME-Version: 1.0 Received: by 10.14.98.193 with HTTP; Mon, 4 Jul 2011 05:29:59 -0700 (PDT) In-Reply-To: References: From: Robin Anil Date: Mon, 4 Jul 2011 17:59:59 +0530 Message-ID: Subject: Re: 20news To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=0016e6d589e0ce8ae604a73d87e6 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d589e0ce8ae604a73d87e6 Content-Type: text/plain; charset=UTF-8 We are using the default lucene tokenizer. You can also pass in a tokenizer via the command line. On Mon, Jul 4, 2011 at 5:55 PM, Vijay Santhanam wrote: > No sir. > > UTF-8 all the way. > > When doing non-sequential training and classification, what class is used > for tokenization? > > I get the feeling different tokenizer classes are used for sequential and > parallel training/classification. > > > > > On Mon, Jul 4, 2011 at 10:23 PM, Robin Anil wrote: > > > Are you using some non-standard Java character encoding? > > > > > > On Mon, Jul 4, 2011 at 5:23 PM, Vijay Santhanam > > wrote: > > > > > Hi, > > > > > > Okay, I replaced all the tab characters with space characters for each > > file > > > in the bayes-test-input folder and now the classifier completes without > > > error. > > > > > > Tomorrow I'll investigate why the trainer correctly parses the > > > tab-separated > > > label correctly, but the classifier does not. Actually, I know why the > > > classifier doesn't extract the correct label--- because > > > org.apache.mahout.common.nlp.NGrams tokenizes via spaces only. > > > > > > The other mystery is why it works for everyone else except poor me :( > > > > > > If anyone has any ideas I'd love to hear it. > > > > > > Cheers, > > > Vijay > > > > > > > > > > > > On Mon, Jul 4, 2011 at 9:16 PM, Vijay Santhanam > > > wrote: > > > > > > > Hi, > > > > > > > > I got debugger running w/ eclipse so I could watch what was happening > > > under > > > > the hood. > > > > > > > > Here's the exception again > > > > Exception in thread "main" java.lang.IllegalArgumentException: Label > > not > > > > found: alt.atheism from > > > > at > > > > > > com.google.common.base.Preconditions.checkArgument(Preconditions.java:88) > > > > at > > > > > > > > > > org.apache.mahout.classifier.ConfusionMatrix.getCount(ConfusionMatrix.java:93) > > > > at > > > > > > > > > > org.apache.mahout.classifier.ConfusionMatrix.incrementCount(ConfusionMatrix.java:113) > > > > at > > > > > > > > > > org.apache.mahout.classifier.ConfusionMatrix.incrementCount(ConfusionMatrix.java:117) > > > > at > > > > > > > > > > org.apache.mahout.classifier.ConfusionMatrix.addInstance(ConfusionMatrix.java:85) > > > > at > > > > > > > > > > org.apache.mahout.classifier.ResultAnalyzer.addInstance(ResultAnalyzer.java:67) > > > > at > > > > > > > > > > org.apache.mahout.classifier.bayes.TestClassifier.classifySequential(TestClassifier.java:244) > > > > at > > > > > > > > > > org.apache.mahout.classifier.bayes.TestClassifier.main(TestClassifier.java:177) > > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > > > at > > > > > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > > > > at > > > > > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > > > > at java.lang.reflect.Method.invoke(Method.java:597) > > > > at > > > > > > > > > > org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) > > > > at > org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) > > > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:188) > > > > > > > > Notice the "Label not found: alt.atheism\tfrom" > > > > > > > > That's an invalid label in the confusion matrix. I think it SHOULD be > > > just > > > > alt.atheism. I'm not sure how the \tfrom is getting in there, but it > > is. > > > > Perhaps it has something to do with the way my test data was > formatted. > > > > > > > > I'll keep digging.... > > > > > > > > Thanks, > > > > Vijay > > > > > > > > > > > > > > > > On Mon, Jul 4, 2011 at 8:52 PM, Vijay Santhanam < > > > vijay.santhanam@gmail.com > > > > > wrote: > > > > > > > >> Hi Robin, > > > >> > > > >> The console dump was a too large for pastebin, so I uploaded it here > > -- > > > >> > http://dl.dropbox.com/u/7881451/build-20news-bayes-console-output.txt > > > >> > > > >> I performed a fresh checkout only hours ago, and I used script > > > >> examples/bin/build-20news-bayes.sh > > > >> I've opted to avoid hadoop, but from what I can tell the model was > > > created > > > >> with success. > > > >> > > > >> > > > >> Thanks, > > > >> Vijay > > > >> > > > >> > > > >> On Mon, Jul 4, 2011 at 8:28 PM, Robin Anil > > > wrote: > > > >> > > > >>> Can you send me the console dump > > > >>> Command line + Log written by the program and put it on say > pastebin > > > >>> > > > >>> Robin > > > >>> > > > >>> On Mon, Jul 4, 2011 at 3:48 PM, Vijay Santhanam > > > >>> wrote: > > > >>> > > > >>> > I tried deleting all the folders from the test and train data > > except > > > >>> for > > > >>> > alt.atheism, but I get the identical error. > > > >>> > > > > >>> > I might try debugging the problem in eclipse rather than from > > > >>> commandline, > > > >>> > but Eclipse doesn't quite want to work either. > > > >>> > > > > >>> > > > > >>> > On Mon, Jul 4, 2011 at 8:02 PM, Vijay Santhanam > > > >>> > wrote: > > > >>> > > > > >>> > > Thanks anyway Sergey. Could you perhaps upload your bayes-model > > > >>> folder so > > > >>> > I > > > >>> > > could try that out? > > > >>> > > > > > >>> > > > > > >>> > > > > > >>> > > On Mon, Jul 4, 2011 at 7:57 PM, Sergey Bartunov > > gmail.com > > > >>> > >wrote: > > > >>> > > > > > >>> > >> Well, that's strange. Sorry, I can't help you at the moment, > > maybe > > > >>> > >> someone else in the mailing list could. > > > >>> > >> > > > >>> > >> On 4 July 2011 13:49, Vijay Santhanam < > > vijay.santhanam@gmail.com> > > > >>> > wrote: > > > >>> > >> > Hi Sergey, > > > >>> > >> > > > > >>> > >> > Yes, there were no errors. > > > >>> > >> > > > > >>> > >> > And all the model data seems to have been populated into > > > >>> bayes-model > > > >>> > >> folder. > > > >>> > >> > Also, each main folder in bayes-model has a _SUCESS file. > > > >>> > >> > > > > >>> > >> > See the tarball of my trained model here, > > > >>> > >> > http://dl.dropbox.com/u/7881451/bayes-model.tar.gz > > > >>> > >> > Please compare it to your trained model if possible, I would > > > like > > > >>> to > > > >>> > >> know if > > > >>> > >> > it's different in any way. > > > >>> > >> > > > > >>> > >> > Perhaps it's corrupted in someway. > > > >>> > >> > > > > >>> > >> > Thanks, > > > >>> > >> > Vijay > > > >>> > >> > > > > >>> > >> > > > > >>> > >> > > > > >>> > >> > On Mon, Jul 4, 2011 at 7:39 PM, Sergey Bartunov > > >>> gmail.com> > > > >>> > >> wrote: > > > >>> > >> > > > > >>> > >> >> Stop, did you _train_ the classifier successfully before > > > running > > > >>> the > > > >>> > >> >> _test_? > > > >>> > >> >> > > > >>> > >> >> On 4 July 2011 13:30, Vijay Santhanam < > > > vijay.santhanam@gmail.com > > > >>> > > > > >>> > >> wrote: > > > >>> > >> >> > Hi Sergey, > > > >>> > >> >> > > > > >>> > >> >> > I've tried using both the sh script file and following > the > > > >>> > >> instructions > > > >>> > >> >> at > > > >>> > >> >> > https://cwiki.apache.org/MAHOUT/twenty-newsgroups.html - > > > like > > > >>> you > > > >>> > >> >> suggested. > > > >>> > >> >> > Both return the same results. > > > >>> > >> >> > > > > >>> > >> >> > I've uploaded my bayes-test-input folder to dropbox, the > > > first > > > >>> file > > > >>> > >> is > > > >>> > >> >> > here... > > > >>> > >> >> > > > > >>> http://dl.dropbox.com/u/7881451/bayes-test-input/alt.atheism.txt > > > >>> > >> >> > > > > >>> > >> >> > Thanks, > > > >>> > >> >> > Vijay > > > >>> > >> >> > > > > >>> > >> >> > On Mon, Jul 4, 2011 at 7:23 PM, Sergey Bartunov < > sbos.net@ > > > >>> > gmail.com> > > > >>> > >> >> wrote: > > > >>> > >> >> > > > > >>> > >> >> >> Paste somewhere your bayes-test-input file. > > > >>> > >> >> >> > > > >>> > >> >> >> On 4 July 2011 13:20, Sergey Bartunov gmail.com > > > > > > >>> wrote: > > > >>> > >> >> >> > Yes, I worked WITH hadoop, but there should be no > > > >>> difference. > > > >>> > >> >> >> > > > > >>> > >> >> >> > Why do you use examples/bin/build/20news-bayes.sh > > instead > > > of > > > >>> > >> direct > > > >>> > >> >> >> > running bin/mahout? Is it the same? > > > >>> > >> >> >> > > > > >>> > >> >> >> > On 4 July 2011 13:12, Vijay Santhanam < > > > >>> > vijay.santhanam@gmail.com> > > > >>> > >> >> wrote: > > > >>> > >> >> >> >> Thanks Sergey, > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> I'm still receiving the same error after following > > those > > > >>> steps. > > > >>> > >> >> >> >> I've chosen not to use hadoop - does yours work WITH > > > >>> hadoop? > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> A few bits of info that might be relevant. > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> My examples/bin/work folder contains the expected > > folders > > > >>> from > > > >>> > >> test > > > >>> > >> >> data > > > >>> > >> >> >> >> preparation and training... > > > >>> > >> >> >> >> drwxr-xr-x@ 22 Vijay staff 748 18 Mar 2003 > > > >>> > 20news-bydate-test > > > >>> > >> >> >> >> drwxr-xr-x@ 22 Vijay staff 748 18 Mar 2003 > > > >>> > >> 20news-bydate-train > > > >>> > >> >> >> >> drwxr-xr-x 3 Vijay staff 102 4 Jul 19:03 > > bayes-model > > > >>> > >> >> >> >> drwxr-xr-x 22 Vijay staff 748 4 Jul 18:20 > > > >>> bayes-test-input > > > >>> > >> >> >> >> drwxr-xr-x 22 Vijay staff 748 4 Jul 17:49 > > > >>> bayes-train-input > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> I appreciate your help, do you have any other > > > suggestions? > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> Regards, > > > >>> > >> >> >> >> Vijay > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> On Mon, Jul 4, 2011 at 6:58 PM, Sergey Bartunov < > > > sbos.net@ > > > >>> > >> gmail.com> > > > >>> > >> >> >> wrote: > > > >>> > >> >> >> >> > > > >>> > >> >> >> >>> When I started with Mahout I had the same errors. In > > my > > > >>> case, > > > >>> > I > > > >>> > >> just > > > >>> > >> >> >> >>> didn't run PrepareTwentyNewsgroups. You may try to > > > >>> accurately > > > >>> > >> repeat > > > >>> > >> >> >> >>> all steps from > > > >>> > >> >> https://cwiki.apache.org/MAHOUT/twenty-newsgroups.html > > > >>> > >> >> >> >>> > > > >>> > >> >> >> >>> On 4 July 2011 12:52, Vijay Santhanam < > > > >>> > >> vijay.santhanam@gmail.com> > > > >>> > >> >> >> wrote: > > > >>> > >> >> >> >>> > Hi All, > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > I'm new to Mahout and I'm interested in > > experimenting > > > >>> with > > > >>> > >> it's > > > >>> > >> >> >> >>> classifiers. > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > Right now, I'm just trying to get up and running > > with > > > >>> the > > > >>> > >> demo's > > > >>> > >> >> and > > > >>> > >> >> >> >>> > examples. > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > After checking out the mahout trunk, I've tried > > > running > > > >>> the > > > >>> > >> >> >> >>> classification > > > >>> > >> >> >> >>> > example 20news, but after running the > > > >>> > >> >> >> >>> ./examples/bin/build/20news-bayes.sh > > > >>> > >> >> >> >>> > script I get the following error during the > > > >>> classification > > > >>> > >> phase. > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > Does anyone else get the same thing? Or have any > > > >>> > >> recommendations > > > >>> > >> >> >> about > > > >>> > >> >> >> >>> how > > > >>> > >> >> >> >>> > to fix it? > > > >>> > >> >> >> >>> > I'd just like to get a sample classifier working > > > before > > > >>> I > > > >>> > >> embark > > > >>> > >> >> on > > > >>> > >> >> >> my > > > >>> > >> >> >> >>> own > > > >>> > >> >> >> >>> > classification journey. > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > INFO: Loading model from: > > > >>> > >> >> >> >>> > > > {basePath=examples/bin/work/20news-bydate/bayes-model, > > > >>> > >> >> >> >>> classifierType=bayes, > > > >>> > >> >> >> >>> > alpha_i=1.0, dataSource=hdfs, gramSize=1, > > > verbose=false, > > > >>> > >> >> >> encoding=UTF-8, > > > >>> > >> >> >> >>> > defaultCat=unknown, > > > >>> > >> >> >> >>> > > > > >>> > testDirPath=examples/bin/work/20news-bydate/bayes-test-input} > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:25 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: Testing Bayes Classifier > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:27 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: Read 50000 feature weights > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:27 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: Read 100000 feature weights > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:28 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: 193370.88331085522 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: rec.sport.baseball -129829.34738930278 > > > >>> > 531784.7805631821 > > > >>> > >> >> >> >>> > -0.2441388925268003 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: sci.crypt -193023.42370049533 > > 531784.7805631821 > > > >>> > >> >> >> -0.3629728242618669 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: rec.sport.hockey -167853.6159738822 > > > >>> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.31564200802459647 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: talk.politics.guns -203524.0148974065 > > > >>> > 531784.7805631821 > > > >>> > >> >> >> >>> > -0.3827187658170024 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: soc.religion.christian -163900.9258713857 > > > >>> > >> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.308209132457322 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: sci.electronics -142854.1677345925 > > > >>> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.26863154598614886 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: comp.os.ms-windows.misc -531784.7805631821 > > > >>> > >> 531784.7805631821 > > > >>> > >> >> >> -1.0 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: misc.forsale -143454.70176448982 > > > 531784.7805631821 > > > >>> > >> >> >> >>> > -0.26976082619845826 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: talk.religion.misc -139428.73484148504 > > > >>> > 531784.7805631821 > > > >>> > >> >> >> >>> > -0.2621901565024562 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: alt.atheism -139569.06867597546 > > > 531784.7805631821 > > > >>> > >> >> >> >>> -0.2624540486626301 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: comp.windows.x -178029.10523376046 > > > >>> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.33477660839638973 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: talk.politics.mideast -193075.00789450994 > > > >>> > >> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.36306982627452317 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: comp.sys.ibm.pc.hardware -138410.02049984262 > > > >>> > >> >> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.2602745049477736 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: comp.sys.mac.hardware -125200.9927438868 > > > >>> > >> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.23543545682389364 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: sci.space -192437.0009266271 > 531784.7805631821 > > > >>> > >> >> >> -0.3618700797018455 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: rec.motorcycles -143142.20855440624 > > > >>> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.26917319522159455 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: rec.autos -141800.97549909537 > > 531784.7805631821 > > > >>> > >> >> >> -0.2666510601317365 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: comp.graphics -166882.18654471825 > > > >>> 531784.7805631821 > > > >>> > >> >> >> >>> > -0.3138152738556811 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: talk.politics.misc -165196.84193278523 > > > >>> > 531784.7805631821 > > > >>> > >> >> >> >>> > -0.3106460507535303 > > > >>> > >> >> >> >>> > Jul 4, 2011 6:28:30 PM > > org.slf4j.impl.JCLLoggerAdapter > > > >>> info > > > >>> > >> >> >> >>> > INFO: sci.med -192698.5183245711 531784.7805631821 > > > >>> > >> >> >> -0.36236185270382393 > > > >>> > >> >> >> >>> > Exception in thread "main" > > > >>> > java.lang.IllegalArgumentException: > > > >>> > >> >> Label > > > >>> > >> >> >> not > > > >>> > >> >> >> >>> > found: alt.atheism from > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > com.google.common.base.Preconditions.checkArgument(Preconditions.java:88) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.ConfusionMatrix.getCount(ConfusionMatrix.java:93) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.ConfusionMatrix.incrementCount(ConfusionMatrix.java:113) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.ConfusionMatrix.incrementCount(ConfusionMatrix.java:117) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.ConfusionMatrix.addInstance(ConfusionMatrix.java:85) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.ResultAnalyzer.addInstance(ResultAnalyzer.java:67) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.bayes.TestClassifier.classifySequential(TestClassifier.java:244) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.mahout.classifier.bayes.TestClassifier.main(TestClassifier.java:177) > > > >>> > >> >> >> >>> > at > > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native > > > >>> > >> Method) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > > > >>> > >> >> >> >>> > at > java.lang.reflect.Method.invoke(Method.java:597) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> > > > >>> > >> >> > > > >>> > >> > > > >>> > > > > >>> > > > > > > org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> >> > > > >>> > > org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) > > > >>> > >> >> >> >>> > at > > > >>> > >> >> > > > org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:188) > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > Any help is great appreciated. > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > Regards, > > > >>> > >> >> >> >>> > -- > > > >>> > >> >> >> >>> > Vijay Santhanam > > > >>> > >> >> >> >>> > Software Engineer > > > >>> > >> >> >> >>> > > > > >>> > >> >> >> >>> > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> > > > >>> > >> >> >> >> -- > > > >>> > >> >> >> >> Vijay Santhanam > > > >>> > >> >> >> >> Software Engineer > > > >>> > >> >> >> >> http://au.linkedin.com/in/vijaysanthanam > > > >>> > >> >> >> >> 0407525087 > > > >>> > >> >> >> >> > > > >>> > >> >> >> > > > > >>> > >> >> >> > > > >>> > >> >> > > > > >>> > >> >> > > > > >>> > >> >> > > > > >>> > >> >> > -- > > > >>> > >> >> > Vijay Santhanam > > > >>> > >> >> > Software Engineer > > > >>> > >> >> > http://au.linkedin.com/in/vijaysanthanam > > > >>> > >> >> > 0407525087 > > > >>> > >> >> > > > > >>> > >> >> > > > >>> > >> > > > > >>> > >> > > > > >>> > >> > > > > >>> > >> > -- > > > >>> > >> > Vijay Santhanam > > > >>> > >> > Software Engineer > > > >>> > >> > http://au.linkedin.com/in/vijaysanthanam > > > >>> > >> > 0407525087 > > > >>> > >> > > > > >>> > >> > > > >>> > > > > > >>> > > > > > >>> > > > > > >>> > > -- > > > >>> > > Vijay Santhanam > > > >>> > > Software Engineer > > > >>> > > http://au.linkedin.com/in/vijaysanthanam > > > >>> > > 0407525087 > > > >>> > > > > > >>> > > > > >>> > > > > >>> > > > > >>> > -- > > > >>> > Vijay Santhanam > > > >>> > Software Engineer > > > >>> > http://au.linkedin.com/in/vijaysanthanam > > > >>> > 0407525087 > > > >>> > > > > >>> > > > >> > > > >> > > > >> > > > >> -- > > > >> Vijay Santhanam > > > >> Software Engineer > > > >> http://au.linkedin.com/in/vijaysanthanam > > > >> 0407525087 > > > >> > > > > > > > > > > > > > > > > -- > > > > Vijay Santhanam > > > > Software Engineer > > > > http://au.linkedin.com/in/vijaysanthanam > > > > 0407525087 > > > > > > > > > > > > > > > > -- > > > Vijay Santhanam > > > Software Engineer > > > http://au.linkedin.com/in/vijaysanthanam > > > 0407525087 > > > > > > > > > -- > Vijay Santhanam > Software Engineer > http://au.linkedin.com/in/vijaysanthanam > 0407525087 > --0016e6d589e0ce8ae604a73d87e6--