nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "M.Rizwan" <muhammad.riz...@sigmatec.com.pk>
Subject Re: Exception org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: file:/home/nutch/1.4/runtime/local/crawl/segments/20111209174842/parse_data
Date Sat, 10 Dec 2011 13:54:41 GMT
Thanks Rami. Yes not a good solution but this worked for me too.

Thanks for sharing.

On Fri, Dec 9, 2011 at 5:13 PM, remi tassing <tassingremi@gmail.com> wrote:

> Sorry, I forgot to change the title...
>
> However I had the same error "Exception
> org.apache.hadoop.mapred.InvalidInputException: Input path does not exist:
> file:/home/nutch/1.4/runtime/local/crawl/segments/..." this morning.
>
> I believe it's because I stopped Nutch while it was crawling and data were
> not saved properly.
>
> I couldn't find an alternative and just had to delete my "crawl" folder,
> then it worked...Not a good solution!
>
> On Fri, Dec 9, 2011 at 2:08 PM, Lewis John Mcgibbney <
> lewis.mcgibbney@gmail.com> wrote:
>
> > Hi Remi,
> >
> > Please don't hijack someone's thread, start your own.
> >
> > Thank you
> >
> > Lewis
> >
> > On Fri, Dec 9, 2011 at 8:26 AM, remi tassing <tassingremi@gmail.com>
> > wrote:
> >
> > > Hello guys,
> > >
> > > how do you use "org.apache.nutch.net.URLFilterChecker"? It's not
> > documented
> > > and it always shows me this "Checking combination of all URLFilters
> > > available" and then gets stuck.
> > >
> > > Remi
> > >
> >
> >
> >
> > --
> > *Lewis*
> >
>
>
>
> --
> Remi Tassing
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message