Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@apache.org Received: (qmail 21819 invoked from network); 4 May 2002 14:32:30 -0000 Received: from unknown (HELO nagoya.betaversion.org) (192.18.49.131) by daedalus.apache.org with SMTP; 4 May 2002 14:32:30 -0000 Received: (qmail 18366 invoked by uid 97); 4 May 2002 14:32:30 -0000 Delivered-To: qmlist-jakarta-archive-lucene-dev@nagoya.betaversion.org Received: (qmail 18279 invoked by alias); 4 May 2002 14:32:30 -0000 Delivered-To: jakarta-archive-lucene-dev@jakarta.apache.org Received: (qmail 18259 invoked by uid 97); 4 May 2002 14:32:29 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 18248 invoked by alias); 4 May 2002 14:32:28 -0000 X-Antivirus: nagoya (v4198 created Apr 24 2002) Date: 4 May 2002 14:32:24 -0000 Message-ID: <20020504143224.12015.qmail@icarus.apache.org> From: otis@apache.org To: jakarta-lucene-sandbox-cvs@apache.org Subject: cvs commit: jakarta-lucene-sandbox/contributions/webcrawler-LARM README.txt X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N otis 02/05/04 07:32:24 Added: contributions/webcrawler-LARM README.txt Log: - A REDME for LARM webcrawler contribution. Revision Changes Path 1.1 jakarta-lucene-sandbox/contributions/webcrawler-LARM/README.txt Index: README.txt =================================================================== $Id: README.txt,v 1.1 2002/05/04 14:32:24 otis Exp $ This is the README file for webcrawler-LARM contribution to Lucene Sandbox. - This contribution requires: a) HTTPClient (not Jakarta's, but this one: http://www.innovation.ch/java/HTTPClient/ b) Jakarta ORO package for regular expressions - The original archive file that I got from Clemens had ORO and HTTPClient in lib directory. I don't think we should include those there, so I took them out. - This contribution also uses 3rd party (X?)HTML parser, which is included. I am not sure if Clemens' modified this parser in any way. If not, maybe we don't have to include it and can instead just add it to the list of required packages. - This code requires(?) JDK 1.4, as it uses assert keyword. $Id: README.txt,v 1.1 2002/05/04 14:32:24 otis Exp $ -- To unsubscribe, e-mail: For additional commands, e-mail: