Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 42261 invoked from network); 27 Jun 2002 15:26:20 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by 209.66.108.5 with SMTP; 27 Jun 2002 15:26:20 -0000 Received: (qmail 26974 invoked by uid 97); 27 Jun 2002 15:26:26 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 26953 invoked by uid 97); 27 Jun 2002 15:26:25 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 26928 invoked by uid 98); 27 Jun 2002 15:26:24 -0000 X-Antivirus: nagoya (v4198 created Apr 24 2002) Message-ID: <010901c21dee$f3fb7480$1801a8c0@dima> From: "Mike Tinnes" To: "Lucene Users List" References: <1233.131.227.138.135.1025182550.squirrel@ike.ee.surrey.ac.uk> Subject: Re: SPIDER /CRAWLERS /ROBOTS with lucene Date: Thu, 27 Jun 2002 10:25:59 -0500 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2600.0000 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 X-Spam-Rating: 209.66.108.5 1.6.2 0/1000/N X-Spam-Rating: 209.66.108.5 1.6.2 0/1000/N I've been using webSphinx with Lucene by simply extending the Crawler class and placing my Lucene code in the overridden 'visit' method. Seems to work, but I've encountered problems with 'OutOfMemory' errors when crawling large sites with 512mb and also using the -Xmx VM args. The sphinx faq mentions the problem, but the recommened fixes don't seem to help. Alas I've resorted to implementing a custom crawler. ----- Original Message ----- From: "A Rambocus" To: Sent: Thursday, June 27, 2002 7:55 AM Subject: SPIDER /CRAWLERS /ROBOTS with lucene > > Hello all does anyone know how to integrate th eWebSphinx with lucene... > - the code previous distributed on this list does not work! > > I am currently trying spindle....... > > but does anyone know if lucene could be used to support image indexing > since this would be very helpful!! > > Cheers > > Ajay R > > > -- > To unsubscribe, e-mail: > For additional commands, e-mail: > -- To unsubscribe, e-mail: For additional commands, e-mail: