lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Thomas Krämer <kraem...@smail.uni-koeln.de>
Subject Types of field in index
Date Mon, 22 Dec 2003 12:38:59 GMT
Hello Lucene Users

i use Lucene 1.3rc3 to index several thousand metadata records. these look as follows:

<?xml version="1.0" encoding="utf-8"?>
<oaidc:dc xmlns="http://purl.org/dc/elements/1.1/" 
xmlns:oaidc="http://www.openarchives.org/OAI/2.0/oai_dc/" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ 
http://www.openarchives.org/OAI/2.0/oai_dc.xsd"> 
<identifier>http://infomotions.com/musings/adaptive-technologies/index.shtml</identifier>

<creator>Morgan, Eric Lease</creator>  <title>Adaptive Technologies</title>
 <date>1999-04-03</date> 
  <description>Computer technology is improving the access to information for the disabled
but the 
real problem is not necessarily the limitations of technology but it's costs. This column
points you 
toward sets of Internet resources informing you on how you can make your library better prepared
for 
the disabled. It also tries to enforce the understanding that technologies for the disabled
assist 
not only the disabled, but everybody else as well. Libraries of the future will understand
this fact 
and embrace this change not as a hinderance but as an enhancement of service for 
everybody.</description>  <subject>CIL (Computers In Libraries)</subject>
 <subject>adaptive 
technologies</subject>    <rights>This text is distributed under the GNU Public
License.</rights> 
<type>text</type>  <format>text/html</format>  <language>en-US</language>
 <contributor>Infomotions, 
Inc.</contributor>     </oaidc:dc>


now, how can i find out what fields lucene indexes? Is there any possibility to control which
tags 
are handled as a field and which not?

I use a slightly modified IndexHTML.java from the 1.3 distribution; i added .xml as acceptable
files 
extension.


Kind regards

thomas


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message