lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tremont romain <romain.trem...@xml-ais.com>
Subject Lucene files
Date Mon, 28 Oct 2002 12:01:03 GMT
Hi folk,

	I was looking in the structure of the index files and here is what 
I have found... I'd like to know if what I have seen is right and if I
didn't forget something:

*.fdt:
	- Start with the number of record
	- for each record
		- record number
		- if it's tokenized
		- record value (which field stored is true, this is for short 
		  field)

*.fnm:
	-contain fields information:
	for each fields:
		- field name
		- if field is indexed
segment:
	- contain the base name of the segment file

*.prx:
	- contain proximity offset for each indexed word

*.freq: 
if the document is not optimize
	- contain document number and frequence of the word in the document
if the document is optimized 
	- contain the frequence of the word in the document 

.f([1-9])+:

	I don't know (here I need your help)

.tii or .tis:
	I m really not sure about this

		- contain word
		- proximity
		- frequence


Thanks in advance :)
		
-- 

Trémont romain 
EPITA promotion 2004
Stagiaire chez AIS

--
To unsubscribe, e-mail:   <mailto:lucene-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-dev-help@jakarta.apache.org>


Mime
View raw message