nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 皮皮 <pi.bingf...@gmail.com>
Subject Re: Help me, No urls to fetch.
Date Fri, 04 Sep 2009 04:39:06 GMT
check the time clocks of namenode and datanode  is synchronized.

2009/9/3 MilleBii <millebii@gmail.com>

> Is there more information in logs/hadoop file ?
>
> What is your plug-in list ?
>
> 2009/9/2 zo tiger <zo.tiger@hotmail.com>
>
> >
> > Thank you for your reply.
> >
> > In urls directory(exactly /nutch/search/urls) , there is a file
> > urllist.txt.
> >
> > content is as following.
> >
> >      http://lucene.apache.org
> >
> > I don't understand why nutch can not fetch any url.
> >
> >
> > Paul Tomblin wrote:
> > >
> > > On Wed, Sep 2, 2009 at 6:36 AM, zo tiger<zo.tiger@hotmail.com> wrote:
> > >>
> > >
> > >> At last i ran bin/nutch crawl command but it gives
> > >>
> > >> No urls to fetch check your filter and seed list error
> > >>
> > >> I am sure there is no problem in crawl-url filter and other
> > configuration
> > >> xml files
> > >>
> > >> İs anyone know any possible problem????
> > >>
> > >
> > > What's in your url directory?
> > >
> > >
> > > --
> > > http://www.linkedin.com/in/paultomblin
> > >
> > >
> >
> > --
> > View this message in context:
> >
> http://www.nabble.com/Help-me%2C-No-urls-to-fetch.-tp25255142p25255944.html
> > Sent from the Nutch - User mailing list archive at Nabble.com.
> >
> >
>
>
> --
> -MilleBii-
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message