Mailing-List: contact forrest-dev-help@xml.apache.org; run by ezmlm
Precedence: bulk
Reply-To: forrest-dev@xml.apache.org
From: "Ramon Prades" <rprades@porcelanosa.com>
To: <forrest-dev@xml.apache.org>
Subject: RE: Lucene  Search
Date: Thu, 7 Aug 2003 17:45:33 +0200
Message-ID: <001901c35cfa$f310f4e0$0100a8c0@pcramon>
MIME-Version: 1.0
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
In-Reply-To: <3F326D2A.9080405@che-che.com>
Importance: Normal

Hi Cheche

I've already have an indexer for document v1.2, although it needs =
tidying up
and optimizing. It uses SAX as the example you've sent.=20

The idea is parsing all forrest xml files in the site and index them
according to this rules:

- Store title, abstract and authors.
- Index the full content of the document in plain text (I mean, no tags =
at
all).=20

That's going to be done at build time.

Then, in the html generation add another button next to the Google =
search
box to search in the site. This button will pass the request to a =
servlet,
and this servlet will generate an xml list with all the matching =
documents,
including their title and their abstract. To render the xml we can use =
xsl.
The problem is that all this stuff won't work in static sites, but will =
in
the webapp.

This is more or less the idea, but I would like to know your thoughts.

Thanks.

Ramon
-----Mensaje original-----
De: Juan Jose Pablos [mailto:cheche@che-che.com]=20
Enviado el: jueves, 07 de agosto de 2003 17:16
Para: forrest-dev@xml.apache.org
Asunto: Re: Lucene Search


Ram=F3n,

Unfortunaly there is not much work on this, but I am willing to help you =

out.

For creating the index check here:
http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XM=
L-I
ndexing-Demo/
That needs to be extended to support document v12.

Reusing the google seach box is fine but there must be a xsp or similar=20
running on forrest that gets the query search.

Looking forward.

Cheers,
Cheche

Ram=F3n Prades wrote:
> =20
> I need to add searching capabilities to my forrest site. I think the
> search-in-site feature could be done in Lucene. The files could be=20
> indexed at build time, and a search box could be added to all =
generated=20
> pages (or even better, the existing Google search box could be reused=20
> maybe adding a second button "Search in Site").
> =20
> Is there any work on that or shall I do it myself?
> =20
> Regards.
> =20
> Ramon Prades