Return-Path: Delivered-To: apmail-xml-forrest-dev-archive@xml.apache.org Received: (qmail 21358 invoked by uid 500); 7 Aug 2003 15:45:16 -0000 Mailing-List: contact forrest-dev-help@xml.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: forrest-dev@xml.apache.org Delivered-To: mailing list forrest-dev@xml.apache.org Received: (qmail 21328 invoked from network); 7 Aug 2003 15:45:15 -0000 Received: from rosone.porcelanosa.com (212.101.68.110) by daedalus.apache.org with SMTP; 7 Aug 2003 15:45:15 -0000 Received: (qmail 4438 invoked by uid 95); 7 Aug 2003 15:41:13 -0000 Received: from rprades@porcelanosa.com by rosone by uid 92 with qmail-scanner-1.10 (uvscan: v4.1.60/v4217. . Clear:0. Processed in 0.257815 secs); 07 ago 2003 15:41:13 -0000 Received: from unknown (HELO pcramon) (128.1.1.12) by rosone.porcelanosa.com with SMTP; 7 Aug 2003 15:41:12 -0000 From: "Ramon Prades" To: Subject: RE: Lucene Search Date: Thu, 7 Aug 2003 17:45:33 +0200 Message-ID: <001901c35cfa$f310f4e0$0100a8c0@pcramon> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.4510 In-Reply-To: <3F326D2A.9080405@che-che.com> Importance: Normal X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1106 X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Hi Cheche I've already have an indexer for document v1.2, although it needs = tidying up and optimizing. It uses SAX as the example you've sent.=20 The idea is parsing all forrest xml files in the site and index them according to this rules: - Store title, abstract and authors. - Index the full content of the document in plain text (I mean, no tags = at all).=20 That's going to be done at build time. Then, in the html generation add another button next to the Google = search box to search in the site. This button will pass the request to a = servlet, and this servlet will generate an xml list with all the matching = documents, including their title and their abstract. To render the xml we can use = xsl. The problem is that all this stuff won't work in static sites, but will = in the webapp. This is more or less the idea, but I would like to know your thoughts. Thanks. Ramon -----Mensaje original----- De: Juan Jose Pablos [mailto:cheche@che-che.com]=20 Enviado el: jueves, 07 de agosto de 2003 17:16 Para: forrest-dev@xml.apache.org Asunto: Re: Lucene Search Ram=F3n, Unfortunaly there is not much work on this, but I am willing to help you = out. For creating the index check here: http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XM= L-I ndexing-Demo/ That needs to be extended to support document v12. Reusing the google seach box is fine but there must be a xsp or similar=20 running on forrest that gets the query search. Looking forward. Cheers, Cheche Ram=F3n Prades wrote: > =20 > I need to add searching capabilities to my forrest site. I think the > search-in-site feature could be done in Lucene. The files could be=20 > indexed at build time, and a search box could be added to all = generated=20 > pages (or even better, the existing Google search box could be reused=20 > maybe adding a second button "Search in Site"). > =20 > Is there any work on that or shall I do it myself? > =20 > Regards. > =20 > Ramon Prades