Return-Path: Delivered-To: apmail-lucene-nutch-dev-archive@www.apache.org Received: (qmail 1798 invoked from network); 9 Oct 2007 05:10:50 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 9 Oct 2007 05:10:50 -0000 Received: (qmail 27655 invoked by uid 500); 9 Oct 2007 05:10:37 -0000 Delivered-To: apmail-lucene-nutch-dev-archive@lucene.apache.org Received: (qmail 27621 invoked by uid 500); 9 Oct 2007 05:10:37 -0000 Mailing-List: contact nutch-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: nutch-dev@lucene.apache.org Delivered-To: mailing list nutch-dev@lucene.apache.org Received: (qmail 27608 invoked by uid 99); 9 Oct 2007 05:10:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Oct 2007 22:10:37 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of sagar020785@gmail.com designates 209.85.198.187 as permitted sender) Received: from [209.85.198.187] (HELO rv-out-0910.google.com) (209.85.198.187) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Oct 2007 05:10:39 +0000 Received: by rv-out-0910.google.com with SMTP id k20so847240rvb for ; Mon, 08 Oct 2007 22:10:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=X4Hv0gTO/Kn6HoauARV/jAuztkwJcw5PuiqfoYpB1k8=; b=Gy3ZXsb0Ap9j1Z8JQ2byuMLETutt6yyUdQgOWhvbeqxCrLb4kkrEoNgMsEsuRNvQP8NhTx0osSYw5skniWzFUdO6Hd+9p8sx0bM203ou+N6UKXAtzDf0P/8+QXg4N28qcweRNmVthp5sIppEtEw6erxioO+jMHIeoNTiw2wlyv0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=q83BHbrNInEARynTWMLdVrJI28p3MWIAeI1OBc8/72VAM7x6bLNBWqNQdMVr57qgq2iW53TtftAc0i0aU/9rLI57tnWRtZ77WQQE2PtOgeaPnhw223K6TDJyD2uBO/qasvhqr93W47YgOVGTYs0c+CZrURQWE+GW/sBMO8B6Uro= Received: by 10.141.87.13 with SMTP id p13mr3302775rvl.1191906612084; Mon, 08 Oct 2007 22:10:12 -0700 (PDT) Received: by 10.141.27.17 with HTTP; Mon, 8 Oct 2007 22:10:12 -0700 (PDT) Message-ID: <3eddcc810710082210w6b53bfcdm6eadee7203600c6@mail.gmail.com> Date: Tue, 9 Oct 2007 10:40:12 +0530 From: "Sagar Vibhute" To: nutch-dev@lucene.apache.org Subject: Re: Java Packages (missing) In-Reply-To: <4709BF83.9090909@apache.org> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_24304_11116335.1191906612080" References: <3eddcc810710070501j628e52f6kea729ec1a0b937e2@mail.gmail.com> <4709BF83.9090909@apache.org> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_24304_11116335.1191906612080 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi, Thanks Dennis. But then why do I get a ClassNotFoundException when I run a nutch crawl? The hadoop.log says: ------------------------------------------------------------------------------------------------------- 2007-10-09 10:34:21,605 WARN net.URLNormalizers - URLNormalizers:PluginRuntimeException when initializing url normalizer plugin urlnormalizer-basic instance in getURLNormalizers function: attempting to continue instantiating plugins 2007-10-09 10:34:21,608 WARN net.URLNormalizers - URLNormalizers:PluginRuntimeException when initializing url normalizer plugin urlnormalizer-regex instance in getURLNormalizers function: attempting to continue instantiating plugins 2007-10-09 10:34:21,615 WARN net.URLNormalizers - URLNormalizers:PluginRuntimeException when initializing url normalizer plugin urlnormalizer-pass instance in getURLNormalizers function: attempting to continue instantiating plugins 2007-10-09 10:34:21,658 WARN mapred.LocalJobRunner - job_y7yavp java.lang.RuntimeException: org.apache.nutch.plugin.PluginRuntimeException: java.lang.ClassNotFoundException: org.apache.nutch.urlfilter.regex.RegexURLFilter at org.apache.nutch.net.URLFilters.(URLFilters.java:74) at org.apache.nutch.crawl.Injector$InjectMapper.configure(Injector.java :60) at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java :58) at org.apache.hadoop.util.ReflectionUtils.newInstance( ReflectionUtils.java:82) at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34) at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java :58) at org.apache.hadoop.util.ReflectionUtils.newInstance( ReflectionUtils.java:82) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:170) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java :126) Caused by: org.apache.nutch.plugin.PluginRuntimeException: java.lang.ClassNotFoundException: org.apache.nutch.urlfilter.regex.RegexURLFilter at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java :166) at org.apache.nutch.net.URLFilters.(URLFilters.java:54) ... 8 more Caused by: java.lang.ClassNotFoundException: org.apache.nutch.urlfilter.regex.RegexURLFilter at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at java.lang.ClassLoader.loadClass(ClassLoader.java:251) at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java :156) ... 9 more ------------------------------------------------------------------------------------------------------- - Sagar On 10/8/07, Dennis Kubes wrote: > > These are classes from plugins and therefore are in their specific > plugin src directory. For example regex url normalized is found at: > > > NutchTrunk\src\plugin\urlnormalizer-regex\src\java\org\apache\nutch\net\urlnormalizer\regex\RegexURLNormalizer.java > > Dennis Kubes > > Sagar Vibhute wrote: > > Hello, > > > > Does the default provided nutch0.9 package comes with certain java > packages > > missing? > > > > I could compile the source (I downloaded the tarball, not from svn) > using > > ant. But when I start crawling it throws ClassNotFoundException, like: > > java.lang.ClassNotFoundException: > > org.apache.nutch.net.urlnormalizer.basic.BasicURLNormalizer > > java.lang.ClassNotFoundException: > > org.apache.nutch.net.urlnormalizer.regex.RegexURLNormalizer > > java.lang.ClassNotFoundException: > > org.apache.nutch.net.urlnormalizer.pass.PassURLNormalizer > > and others. > > > > Can I get the additional java packages anywhere? (If they are missing). > > > > ThankYou > > > > - Sagar > > > ------=_Part_24304_11116335.1191906612080--