nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <mar...@apache.org>
Subject Re: how are CSV/TXT files handled
Date Tue, 07 Feb 2012 09:17:48 GMT
Upgrade to 1.4.

> With the "nutch parsechecker" command I get the following error message:
> 
> "Error: Could not find or load main class parsechecker", this doesn't sound
> good!
> 
> On Tue, Feb 7, 2012 at 9:58 AM, remi tassing <tassingremi@gmail.com> wrote:
> > The point that made me start thinking is because I got this error
> > message:
> > 
> > "failed(2,0): Can't retrieve Tika parser for mime-type
> > application/ms-excel"
> > 
> > I'm using Nutch-1.2 and my nutch-site.xml has:
> > 
> > "<property>
> > 
> >   <name>plugin.includes</name>
> > 
> > <value>protocol-httpclient|urlfilter-regex|parse-(text|html|js|tika)|inde
> > x-(basic|anchor)|q..."
> > 
> > Remi
> > 
> > On Tue, Feb 7, 2012 at 9:16 AM, remi tassing <tassingremi@gmail.com>wrote:
> >> Hey guys,
> >> 
> >> I checked the mailing-list archive but couldn't get an answer on this. I
> >> think CSV and TXT don't need any kind of parsing, but how.are handled by
> >> default?
> >> 
> >> Remi

Mime
View raw message