I suggest you look at the Articles section of Lucene's site, in
particular an article about XML, Lucene, and Digester. Much better
than using IndexHTML demo, I believe.
Otis
--- Thomas_Krämer <kraemert@smail.uni-koeln.de> wrote:
> Hello Lucene Users
>
> i use Lucene 1.3rc3 to index several thousand metadata records. these
> look as follows:
>
> <?xml version="1.0" encoding="utf-8"?>
> <oaidc:dc xmlns="http://purl.org/dc/elements/1.1/"
> xmlns:oaidc="http://www.openarchives.org/OAI/2.0/oai_dc/"
> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
> xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/
> http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
>
<identifier>http://infomotions.com/musings/adaptive-technologies/index.shtml</identifier>
>
> <creator>Morgan, Eric Lease</creator> <title>Adaptive
> Technologies</title> <date>1999-04-03</date>
> <description>Computer technology is improving the access to
> information for the disabled but the
> real problem is not necessarily the limitations of technology but
> it's costs. This column points you
> toward sets of Internet resources informing you on how you can make
> your library better prepared for
> the disabled. It also tries to enforce the understanding that
> technologies for the disabled assist
> not only the disabled, but everybody else as well. Libraries of the
> future will understand this fact
> and embrace this change not as a hinderance but as an enhancement of
> service for
> everybody.</description> <subject>CIL (Computers In
> Libraries)</subject> <subject>adaptive
> technologies</subject> <rights>This text is distributed under the
> GNU Public License.</rights>
> <type>text</type> <format>text/html</format>
> <language>en-US</language> <contributor>Infomotions,
> Inc.</contributor> </oaidc:dc>
>
>
> now, how can i find out what fields lucene indexes? Is there any
> possibility to control which tags
> are handled as a field and which not?
>
> I use a slightly modified IndexHTML.java from the 1.3 distribution; i
> added .xml as acceptable files
> extension.
>
>
> Kind regards
>
> thomas
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
__________________________________
Do you Yahoo!?
New Yahoo! Photos - easier uploading and sharing.
http://photos.yahoo.com/
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
|