Return-Path:
The active segments in the index are stored in the
segment info file,
- segments_N
- . There may
+ segments_N.
+ There may
be one or more
segments_N
files in the
@@ -1222,13 +1222,13 @@
As of 2.1, there is also a file
- segments.gen
- . This file contains the
+ segments.gen.
+ This file contains the
current generation (the
_N
in
- segments_N
- ) of the index. This is
+ segments_N)
+ of the index. This is
used only as a fallback in case the current
generation cannot be accurately determined by
directory listing alone (as is the case for some
@@ -1248,12 +1248,11 @@
2.1 and above:
- Segments --> Format, Version, NameCounter, SegCount, <SegName, SegSize, DelGen, NumField, NormGen
- NumField
- >
- SegCount
- , IsCompoundFile
-
Format, NameCounter, SegCount, SegSize, NumField --> Int32
@@ -1264,10 +1263,10 @@ SegName --> String- IsCompoundFile --> Int8 + IsCompoundFile, HasSingleNormFile --> Int8
- Format is -1 as of Lucene 1.4 and -2 as of Lucene 2.1. + Format is -1 as of Lucene 1.4 and -3 (SemgentInfos.FORMAT_SINGLE_NORM_FILE) as of Lucene 2.1.
Version counts how often the index has been @@ -1312,7 +1311,14 @@ which means we check filesystem to see if _X.cfs exists.
- ++ If HasSingleNormFile is 1, then the field norms are + written as a single joined file (with extension + .nrm); if it is 0 then each field's norms + are stored as separate .fN files. See + "Normalization Factors" below for details. +
+A write lock is used to indicate that another @@ -1334,7 +1340,7 @@ Note that prior to version 2.1, Lucene also used a commit lock. This was removed in 2.1.
- +Prior to Lucene 2.1 there was a file "deletable" @@ -1343,7 +1349,7 @@ the files that are deletable, instead, so no file is written.
- +Starting with Lucene 1.4 the compound file format became default. This is simply a container for all files described in the next section @@ -1363,14 +1369,14 @@ - +
The remaining files are all per-segment, and are thus defined by suffix.
- +The term dictionary is represented as two files: @@ -1764,7 +1770,7 @@ - +
The .frq file contains the lists of documents @@ -1854,7 +1860,7 @@ nd starts.
- +The .prx file contains the lists of positions that @@ -1899,7 +1905,7 @@
4, 5, 4
- +@@ -2003,7 +2009,7 @@ 2.1 and above: Separate norm files are created (when adequate) for both compound and non compound segments.
- +The .del file is optional, and only exists when a segment contains deletions. @@ -2201,7 +2207,7 @@
There
Modified: lucene/java/trunk/docs/fileformats.pdf
URL: http://svn.apache.org/viewvc/lucene/java/trunk/docs/fileformats.pdf?view=diff&rev=499208&r1=499207&r2=499208
==============================================================================
--- lucene/java/trunk/docs/fileformats.pdf (original)
+++ lucene/java/trunk/docs/fileformats.pdf Tue Jan 23 17:28:53 2007
@@ -267,10 +267,10 @@
>>
endobj
52 0 obj
-<< /Length 626 /Filter [ /ASCII85Decode /FlateDecode ]
+<< /Length 625 /Filter [ /ASCII85Decode /FlateDecode ]
>>
stream
-Gaua<9lHOU&;KZO$6RqgVZ%KMN_NHJ<`,-^W0apt`-O@5HmC8t2Z<6.R6FLV7Stog8_K`]XHs&OCOAR;d;i[oQPi6VW=OdNRE