lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Tinnes" <tin...@ecliptictech.com>
Subject Re: SPIDER /CRAWLERS /ROBOTS with lucene
Date Thu, 27 Jun 2002 15:25:59 GMT

I've been using webSphinx with Lucene by simply extending the Crawler class
and placing my Lucene code in the overridden 'visit' method. Seems to work,
but I've encountered problems with 'OutOfMemory' errors when crawling large
sites with 512mb and also using the -Xmx VM args. The sphinx faq mentions
the problem, but the recommened fixes don't seem to help. Alas I've resorted
to implementing a custom crawler.


----- Original Message -----
From: "A Rambocus" <eem2ar@eim.surrey.ac.uk>
To: <lucene-user@jakarta.apache.org>
Sent: Thursday, June 27, 2002 7:55 AM
Subject: SPIDER /CRAWLERS /ROBOTS with lucene


>
> Hello all does anyone know how to integrate th eWebSphinx with lucene...
>  - the code previous distributed on this list does not work!
>
> I am currently trying spindle.......
>
> but does anyone know if lucene could be used to support image indexing
> since this would be very helpful!!
>
> Cheers
>
> Ajay R
>
>
> --
> To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>
>


--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message