lucene-java-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From mikemcc...@apache.org
Subject svn commit: r761886 - in /lucene/java/trunk: docs/fileformats.html docs/fileformats.pdf src/site/src/documentation/content/xdocs/fileformats.xml
Date Sat, 04 Apr 2009 10:18:05 GMT
Author: mikemccand
Date: Sat Apr  4 10:18:05 2009
New Revision: 761886

URL: http://svn.apache.org/viewvc?rev=761886&view=rev
Log:
clarify limitations of implementation vs file format itself

Modified:
    lucene/java/trunk/docs/fileformats.html
    lucene/java/trunk/docs/fileformats.pdf
    lucene/java/trunk/src/site/src/documentation/content/xdocs/fileformats.xml

Modified: lucene/java/trunk/docs/fileformats.html
URL: http://svn.apache.org/viewvc/lucene/java/trunk/docs/fileformats.html?rev=761886&r1=761885&r2=761886&view=diff
==============================================================================
--- lucene/java/trunk/docs/fileformats.html (original)
+++ lucene/java/trunk/docs/fileformats.html Sat Apr  4 10:18:05 2009
@@ -2430,12 +2430,23 @@
 <a name="N10873"></a><a name="Limitations"></a>
 <h2 class="boxed">Limitations</h2>
 <div class="section">
-<p>There
-                are a few places where these file formats limit the maximum number of
-                terms and documents to a 32-bit quantity, or to approximately 4
-                billion. This is not today a problem, but, in the long term,
-                probably will be. These should therefore be replaced with either
-                UInt64 values, or better yet, with VInt values which have no limit.
+<p>
+	      When referring to term numbers, Lucene's current
+	      implementation uses a Java <span class="codefrag">int</span>, which means
+	      the maximum number of unique terms in any single index
+	      segment is 2,147,483,648.  This is technically not a
+	      limitation of the index file format, just of Lucene's
+	      current implementation.
+	    </p>
+<p>
+	      Similarly, Lucene uses a Java <span class="codefrag">int</span> to refer
+	      to document numbers, and the index file format uses an
+	      <span class="codefrag">Int32</span> on-disk to store document numbers.
+	      This is a limitation of both the index file format and
+	      the current implementation.  Eventually these should be
+	      replaced with either <span class="codefrag">UInt64</span> values, or
+	      better yet, <span class="codefrag">VInt</span> values which have no
+	      limit.
             </p>
 </div>
 

Modified: lucene/java/trunk/docs/fileformats.pdf
URL: http://svn.apache.org/viewvc/lucene/java/trunk/docs/fileformats.pdf?rev=761886&r1=761885&r2=761886&view=diff
==============================================================================
--- lucene/java/trunk/docs/fileformats.pdf (original)
+++ lucene/java/trunk/docs/fileformats.pdf Sat Apr  4 10:18:05 2009
@@ -619,10 +619,10 @@
 >>
 endobj
 103 0 obj
-<< /Length 1831 /Filter [ /ASCII85Decode /FlateDecode ]
+<< /Length 1795 /Filter [ /ASCII85Decode /FlateDecode ]
  >>
 stream
-GatU4>Ar7S'Roe[csq^]MQ".E@DL;C7>G=;(3b+=qES8l9hZ1"M6c/N5CQ,3n5-!p'Le,%Ur7?Qc?BbH[pK!F?@6&jCCe0u(PuW`<Z$S\f+6T]/)MRpBN<:1/H/c,IJ)p8pOPb<Gd?0XSF2UD5K)h_#-Jt-^9u7gO0@+>G%7Z0:<Y4MAXQ`hG_Vg[>(%0_jJ&=#;Jjf*7F-nYXB+\Ef+OW<*E[k&WfK$lkH'b[@k.mXH%F7mJZRC-gGd\2C]2Gp1C/UtHM1hq&#.`!eDZQjX$Hur>TU+bo1nIV8S\c.Z"oM&>Z=7_^ujgVBQ.?34D`Qmpd,tT*IW3?Z'B*gfBV9i@j':PA!asN)n!"UrLapMB?B"\3HBDcjn"8]UcXAGgDP-dhO-4LNB"4^ZK^D_W,a@P/If]Cb;k='^I[<f>&)2X<jEq2`STfY'\Ms>Ps>oING+;.*"%'n/H(KtYn-a?Ha\(:]+4lYcS\>f49\^?4\`ffXX[6ZNC'>uT+NjcN@qVe\GT&8?W\U$)UN@bX3".n_FH+m7aYT3B$Q0gaop.;"h@)WIQYl7\TE<CprZ&5&?%TSAkTu!$*nA!/?=BiWT6Q[jcO-a].0`'m2SdA"1Hfl^VOJ7:r$L8gT3tLC?,ntp^.H92cks'\/,iR^0uiM`(ka;S@*FNclGSqHT.-#GqMC0L>0Z,Ws?(ObN_!7]_RR)SamCh!Um>j^<%b)$ME3$$45BD<<XA&9s9tt)@_<D&Q':9'>S2.'*ZkO7-t)c*_*W;fqi04ke.1p_k&orN&uN)SQH8bVIh?ojgl-.[@2C^2/+uD*MARdV?6o'6Fr[[LVYjM:KRSS*_iQmK,0&#m&-d[B]n#:D)A1ZY#ZF8[5>oJ(U_qA*u/_=pBBFT>RQD4=W\R+(OW,jko*1T$(9.?-kbT'3R/$B_'<"h29*RGW0<Q<h%0St[4%:K_i&'R@?@LHQ%2=X$H&^9<NX/_F_"]*.VtlqFj,"?I6OW
 @!8?%g=\_m@(R*"[noj6,o[QY[`<C[Pen/njh^Vu,`:KOT]j)tuSC];RZ<G01$Yf9:]#!hPNE6)>$#UCm,75JFEF'!'&lo)MB#*P`BH"M7..rGZA]A=3&gDr/V0.XI_$-_qQRd8+rC,]sM7[Ro8>?Zc,:?STM'bT)2X"CEZh1X[&3_Sia)6:BYsfnd&gsL"*Ic=WAmUu7oOcslRg:9lZ/%"aZ/[*Q4KJGL'@QjjS0YPEC2_?]r:@&@hj[p'[JfXP9EL].^34LV//XngMmFHC+Jcd\qp<9[&q8q&QZTb2-nD^;]'FUL)ht-=&?M>2XD9j'>f5Rbq49d.2:Qtsm%bD.7^on+::^ja0;_GAk.Bs!;04d5rLBRCS3HJU6-\p<,J]:!T,,<j'M0"miSaUcKJ4jC7h<pOBnOUl#21SoB8!Fub;q5u8q7j752q1dEG];*Ufh1n9K0<-,=P5N7XIG_-=cl"(kJ!<L'VT6>j6WRBZl&eSjB$9\h:S1^7E%_bRchM@r0X"SP;0-gM-$h'+B!H(4%E1-.-POGUl%]56*?i^+fo_Jia+/]5?(SWSB0VZ,j8kNOnuc=urg4Z+*CcStchgW&;ai7>q];p>32d+;Rqo^&r0i#?F5TS%-T0X],U*H7(D(lm)pD@$#!D$@Q]dXkpHAJ[TVae>h4>b8'Om0MiiY-aKLT+<Ll%cuRHMj;p'64)eXJ9XYQUaChaR4l,H#>)Ih'RlEQV!Ninp+/&'A8J'NG(g,/LM+`.tDa\lRq#8X:+Gr8#G6=R?Y2b1M2,"qe!_':s5DYjm%:%QSgp^Y#F`_Gp@m/-77AmaeCI!2g<20Hsmluap-&V~>
+GatU4>Ar7S'Roe[d+]2W<_2L^$j5X0$!J$K[s6Xpdg*miD/PN8lJaeBIf)`BbX7fh;A;D0OWo(rGKtCbe8dGDAMs?I_p2$VhTZ\o+t!ddq:?/,o>:7+)`f9d*bW74Q=%?'ju.04O`+:T)/=@=q1G5UB0b5`q9CI25$kAbm+PU(p6[@S_brYOneAasQ$sK;bsbCV\XoPk#/&:2B0B]o?I`HXe"5Z/UOK;s/n91/-e9KF.F5h:W?>-F/,.cnjK.3uO@_8'"1"QIH7t`C?>hGu\`dT)c0ING2@V^2ROE%*#"#@na[HM2'2\Y/&.LJ?)+#We!XhIM)6eK:fn-a/Q/;<-?jJpgd/*4i>0e`85O-)U\Rr%A3q[mBldJ44P9%SuM[%j4mY%(oUH#Jrdcoeo-o5%VRTl!YRWD#lIt>Bs=r>+7XC1jYk;t2o-ue2IPlMNbPYatU2?gf)Q>U-3lokBc::a2nqa2"9,M#dV:D;93B:SID&nU[.ohZOr-cV*=A"K],cJOQX$S01e!gp):-2@kur(r/C#l=/;`;VI<ScBBV0\:W0+-1h[&)CUiI,tD1JCIh%AkaT0&[P`[/;p%cB[%&Qb6(HXG4?R6gSl`,n"IHRr@O,E5SJnnf4hR")f,HBZ*2g\G]Q1()V0MQaW;<jCgsH;+.nO@4?!9C]=jX%8Gudakp\.s%p/LAVo%^GI^o@u>=!OkVhoU=IG]_"LdKQLZil/cS$J8=*3*>MqP#hu5U*9S^e=ak78N!gckDWS=F(fiK4;;%lc-"`dQ6aIZ=TF!bG`@6W=1Ls+X'p@kYnOaekqoI$gn"Jm+<>?+O[(W.<eme)0NOg#M*QLDBB9#/0RbcDIbb_&<586'U^#=N)m@B(7D,fi-c0L(#S,#/*WMS5b6H>Jban99eK^\8c8ckBHn32KT6=4U`oPk(oE%X3&c!NEiWCqi.2N;SRGZW7hl5a.TmqJ.$]C+.;2kPoN`Hh\eZUA;8>/'>A8L
 LMI];cY>8aJh10QT;Aj2ORZ;Hj,`-Uqk^p1I+.%2Jo#6H1](3(l+"A(>'h](=13iZ4[!FdhJia1=ec-bCA`Bd[JaFcU9.kb4.`mANZB5]?A%+p"4jpE&fgJ5#b<SB3<?amdi6:2$iN?]D`+Y1Q>d]Yo).nmKEU?=S.1l(Vp,*T>]WrHm#N>L*[VsO$TOjQZLFD&rFTbDi*$=B=XBU\p`4`JD/Ju$<&gP_"s*]jVrr=@Mj-2W<qW70i:3UV'KXh##DoR?d4l<b/7uhDUg5E:O)OZZg0C2@GK1L7-1LAQ0g7u&4mcjq#/d]E]C?W;"`+@bdM<-TGN]N;7W@lkiQ/uO2QBtKGWOhSPal$I2Ai]_uQBSJN!BaGr=.-gWAoIu1=[0Ws#ML-Tju,4%jG1!u,%=s>V]NZd!ai)=7lp^*V`e,Kc8g`$'Ztj[qM`gLWYs%EO_;q,Sa:'lan]T%l736<UaqnOc;=?c26R;P#U=RQh8k[<p7o/?Hh5UFXbdFd`"`_'Eo:9!f4O>s545k=g^)]&T'0uLKOM_0)Y;_*[_"*VM8("^0Yp+0nBh;@Xrmh!%$nIP2A%=&5n;"iA7g`%-c2I58^`)3.V-TTC)fZ[&qslC`0a^FUGi;9e')Zi>#2N@NGP9%_0*nqaGkd8&P*mmaogBA&)a_H!K(0^ZBN@g&bT=WTK?)D9"-2'(j/\$'-E)/!.=[]6/X(ui5hpV$1?)b/W/_Sn+NkjU:AI^,f"^o&.8*/YPR&T=K`b&Mf&FBf;ENK<djla4EK5BVL$V3Y/"05FlboUBkb6[!)oX!c2~>
 endstream
 endobj
 104 0 obj
@@ -633,283 +633,298 @@
 /Contents 103 0 R
 >>
 endobj
+105 0 obj
+<< /Length 736 /Filter [ /ASCII85Decode /FlateDecode ]
+ >>
+stream
+Gat$u9okbt&A@ZcI,%=5f/D9sZ!2(FXEnQfCbKW9%[^W=E,$NMr;2qdfHb1b.O+F&C]%f$Hor\2A)dIZ0\Df^K,5bg!+X7N#IH_!(kN@H!c5QrhF$;9>uQ91\O^`O%8P"B0GkfZJID])WPI,tB3E?>lgf<##->o8f\OS#'t(9UCIDYE\&TkF0^(ShSd]+V@[u,-:SEiD-)%EpP7,'GTeVMP?!:eg[E+'1d(RS$g/XksI^[a3[2$8m;WDo^3#>>QZHgBA$ZfH3QatNZTAs78l,PKF"$I<t^7bu_+cKp:o\\0#:N!+`:!#&Y<5$FboAdD/9rkYr#Do+],_^?'OOl!iR^QpU.ogG'._"TCC5`*(qMMM15r.U_li6]7VOc:r"D+to7A'/_Xcqr._&,7?QH-sCNUp7NW3160[_cn"IE>FHgqHB>B44WoWIO8b0h^k#1hZVnlHb&dp%OFm9:l;>45k@mdVc^7lV>W#[.cNBQ,+jSQ3:PQ[g/3fOIc9-5rm(%13KBApq[QJ<bH7Q:NQ_-0Y#I\1fTj\,LsCh:WN<eAtVI[h">eFXVPS+GMpTKSu1<^L5'K@8;?6\n'Q\.UC`0%g:R\(g+V<1?G-&Jb]*WXE!lCSh^uqUFu%'N>2(_T@6gCFN@BQ9UEBn;qKRDR^"OR9g_E2oLJb862aXc<"<hAS1.a>'8%J^cX1WlBg9I_Z=(c\o-F^:f@,PoBD!"506jBc6^B%u$Og"~>
+endstream
+endobj
 106 0 obj
+<< /Type /Page
+/Parent 1 0 R
+/MediaBox [ 0 0 612 792 ]
+/Resources 3 0 R
+/Contents 105 0 R
+>>
+endobj
+108 0 obj
 <<
  /Title (\376\377\0\61\0\40\0\111\0\156\0\144\0\145\0\170\0\40\0\106\0\151\0\154\0\145\0\40\0\106\0\157\0\162\0\155\0\141\0\164\0\163)
- /Parent 105 0 R
- /Next 107 0 R
+ /Parent 107 0 R
+ /Next 109 0 R
  /A 9 0 R
 >> endobj
-107 0 obj
+109 0 obj
 <<
  /Title (\376\377\0\62\0\40\0\104\0\145\0\146\0\151\0\156\0\151\0\164\0\151\0\157\0\156\0\163)
- /Parent 105 0 R
- /First 108 0 R
- /Last 111 0 R
- /Prev 106 0 R
- /Next 112 0 R
+ /Parent 107 0 R
+ /First 110 0 R
+ /Last 113 0 R
+ /Prev 108 0 R
+ /Next 114 0 R
  /Count -4
  /A 11 0 R
 >> endobj
-108 0 obj
+110 0 obj
 <<
  /Title (\376\377\0\62\0\56\0\61\0\40\0\111\0\156\0\166\0\145\0\162\0\164\0\145\0\144\0\40\0\111\0\156\0\144\0\145\0\170\0\151\0\156\0\147)
- /Parent 107 0 R
- /Next 109 0 R
+ /Parent 109 0 R
+ /Next 111 0 R
  /A 13 0 R
 >> endobj
-109 0 obj
+111 0 obj
 <<
  /Title (\376\377\0\62\0\56\0\62\0\40\0\124\0\171\0\160\0\145\0\163\0\40\0\157\0\146\0\40\0\106\0\151\0\145\0\154\0\144\0\163)
- /Parent 107 0 R
- /Prev 108 0 R
- /Next 110 0 R
+ /Parent 109 0 R
+ /Prev 110 0 R
+ /Next 112 0 R
  /A 15 0 R
 >> endobj
-110 0 obj
+112 0 obj
 <<
  /Title (\376\377\0\62\0\56\0\63\0\40\0\123\0\145\0\147\0\155\0\145\0\156\0\164\0\163)
- /Parent 107 0 R
- /Prev 109 0 R
- /Next 111 0 R
+ /Parent 109 0 R
+ /Prev 111 0 R
+ /Next 113 0 R
  /A 17 0 R
 >> endobj
-111 0 obj
+113 0 obj
 <<
  /Title (\376\377\0\62\0\56\0\64\0\40\0\104\0\157\0\143\0\165\0\155\0\145\0\156\0\164\0\40\0\116\0\165\0\155\0\142\0\145\0\162\0\163)
- /Parent 107 0 R
- /Prev 110 0 R
+ /Parent 109 0 R
+ /Prev 112 0 R
  /A 19 0 R
 >> endobj
-112 0 obj
+114 0 obj
 <<
  /Title (\376\377\0\63\0\40\0\117\0\166\0\145\0\162\0\166\0\151\0\145\0\167)
- /Parent 105 0 R
- /Prev 107 0 R
- /Next 113 0 R
+ /Parent 107 0 R
+ /Prev 109 0 R
+ /Next 115 0 R
  /A 21 0 R
 >> endobj
-113 0 obj
+115 0 obj
 <<
  /Title (\376\377\0\64\0\40\0\106\0\151\0\154\0\145\0\40\0\116\0\141\0\155\0\151\0\156\0\147)
- /Parent 105 0 R
- /Prev 112 0 R
- /Next 114 0 R
+ /Parent 107 0 R
+ /Prev 114 0 R
+ /Next 116 0 R
  /A 23 0 R
 >> endobj
-114 0 obj
+116 0 obj
 <<
  /Title (\376\377\0\65\0\40\0\120\0\162\0\151\0\155\0\151\0\164\0\151\0\166\0\145\0\40\0\124\0\171\0\160\0\145\0\163)
- /Parent 105 0 R
- /First 115 0 R
- /Last 120 0 R
- /Prev 113 0 R
- /Next 121 0 R
+ /Parent 107 0 R
+ /First 117 0 R
+ /Last 122 0 R
+ /Prev 115 0 R
+ /Next 123 0 R
  /Count -6
  /A 25 0 R
 >> endobj
-115 0 obj
+117 0 obj
 <<
  /Title (\376\377\0\65\0\56\0\61\0\40\0\102\0\171\0\164\0\145)
- /Parent 114 0 R
- /Next 116 0 R
+ /Parent 116 0 R
+ /Next 118 0 R
  /A 27 0 R
 >> endobj
-116 0 obj
+118 0 obj
 <<
  /Title (\376\377\0\65\0\56\0\62\0\40\0\125\0\111\0\156\0\164\0\63\0\62)
- /Parent 114 0 R
- /Prev 115 0 R
- /Next 117 0 R
+ /Parent 116 0 R
+ /Prev 117 0 R
+ /Next 119 0 R
  /A 29 0 R
 >> endobj
-117 0 obj
+119 0 obj
 <<
  /Title (\376\377\0\65\0\56\0\63\0\40\0\125\0\151\0\156\0\164\0\66\0\64)
- /Parent 114 0 R
- /Prev 116 0 R
- /Next 118 0 R
+ /Parent 116 0 R
+ /Prev 118 0 R
+ /Next 120 0 R
  /A 31 0 R
 >> endobj
-118 0 obj
+120 0 obj
 <<
  /Title (\376\377\0\65\0\56\0\64\0\40\0\126\0\111\0\156\0\164)
- /Parent 114 0 R
- /Prev 117 0 R
- /Next 119 0 R
+ /Parent 116 0 R
+ /Prev 119 0 R
+ /Next 121 0 R
  /A 33 0 R
 >> endobj
-119 0 obj
+121 0 obj
 <<
  /Title (\376\377\0\65\0\56\0\65\0\40\0\103\0\150\0\141\0\162\0\163)
- /Parent 114 0 R
- /Prev 118 0 R
- /Next 120 0 R
+ /Parent 116 0 R
+ /Prev 120 0 R
+ /Next 122 0 R
  /A 35 0 R
 >> endobj
-120 0 obj
+122 0 obj
 <<
  /Title (\376\377\0\65\0\56\0\66\0\40\0\123\0\164\0\162\0\151\0\156\0\147)
- /Parent 114 0 R
- /Prev 119 0 R
+ /Parent 116 0 R
+ /Prev 121 0 R
  /A 37 0 R
 >> endobj
-121 0 obj
+123 0 obj
 <<
  /Title (\376\377\0\66\0\40\0\120\0\145\0\162\0\55\0\111\0\156\0\144\0\145\0\170\0\40\0\106\0\151\0\154\0\145\0\163)
- /Parent 105 0 R
- /First 122 0 R
- /Last 125 0 R
- /Prev 114 0 R
- /Next 126 0 R
+ /Parent 107 0 R
+ /First 124 0 R
+ /Last 127 0 R
+ /Prev 116 0 R
+ /Next 128 0 R
  /Count -4
  /A 39 0 R
 >> endobj
-122 0 obj
+124 0 obj
 <<
  /Title (\376\377\0\66\0\56\0\61\0\40\0\123\0\145\0\147\0\155\0\145\0\156\0\164\0\163\0\40\0\106\0\151\0\154\0\145)
- /Parent 121 0 R
- /Next 123 0 R
+ /Parent 123 0 R
+ /Next 125 0 R
  /A 41 0 R
 >> endobj
-123 0 obj
+125 0 obj
 <<
  /Title (\376\377\0\66\0\56\0\62\0\40\0\114\0\157\0\143\0\153\0\40\0\106\0\151\0\154\0\145)
- /Parent 121 0 R
- /Prev 122 0 R
- /Next 124 0 R
+ /Parent 123 0 R
+ /Prev 124 0 R
+ /Next 126 0 R
  /A 43 0 R
 >> endobj
-124 0 obj
+126 0 obj
 <<
  /Title (\376\377\0\66\0\56\0\63\0\40\0\104\0\145\0\154\0\145\0\164\0\141\0\142\0\154\0\145\0\40\0\106\0\151\0\154\0\145)
- /Parent 121 0 R
- /Prev 123 0 R
- /Next 125 0 R
+ /Parent 123 0 R
+ /Prev 125 0 R
+ /Next 127 0 R
  /A 45 0 R
 >> endobj
-125 0 obj
+127 0 obj
 <<
  /Title (\376\377\0\66\0\56\0\64\0\40\0\103\0\157\0\155\0\160\0\157\0\165\0\156\0\144\0\40\0\106\0\151\0\154\0\145\0\163)
- /Parent 121 0 R
- /Prev 124 0 R
+ /Parent 123 0 R
+ /Prev 126 0 R
  /A 47 0 R
 >> endobj
-126 0 obj
+128 0 obj
 <<
  /Title (\376\377\0\67\0\40\0\120\0\145\0\162\0\55\0\123\0\145\0\147\0\155\0\145\0\156\0\164\0\40\0\106\0\151\0\154\0\145\0\163)
- /Parent 105 0 R
- /First 127 0 R
- /Last 133 0 R
- /Prev 121 0 R
- /Next 134 0 R
+ /Parent 107 0 R
+ /First 129 0 R
+ /Last 135 0 R
+ /Prev 123 0 R
+ /Next 136 0 R
  /Count -7
  /A 49 0 R
 >> endobj
-127 0 obj
+129 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\61\0\40\0\106\0\151\0\145\0\154\0\144\0\163)
- /Parent 126 0 R
- /Next 128 0 R
+ /Parent 128 0 R
+ /Next 130 0 R
  /A 51 0 R
 >> endobj
-128 0 obj
+130 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\62\0\40\0\124\0\145\0\162\0\155\0\40\0\104\0\151\0\143\0\164\0\151\0\157\0\156\0\141\0\162\0\171)
- /Parent 126 0 R
- /Prev 127 0 R
- /Next 129 0 R
+ /Parent 128 0 R
+ /Prev 129 0 R
+ /Next 131 0 R
  /A 56 0 R
 >> endobj
-129 0 obj
+131 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\63\0\40\0\106\0\162\0\145\0\161\0\165\0\145\0\156\0\143\0\151\0\145\0\163)
- /Parent 126 0 R
- /Prev 128 0 R
- /Next 130 0 R
+ /Parent 128 0 R
+ /Prev 130 0 R
+ /Next 132 0 R
  /A 58 0 R
 >> endobj
-130 0 obj
+132 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\64\0\40\0\120\0\157\0\163\0\151\0\164\0\151\0\157\0\156\0\163)
- /Parent 126 0 R
- /Prev 129 0 R
- /Next 131 0 R
+ /Parent 128 0 R
+ /Prev 131 0 R
+ /Next 133 0 R
  /A 60 0 R
 >> endobj
-131 0 obj
+133 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\65\0\40\0\116\0\157\0\162\0\155\0\141\0\154\0\151\0\172\0\141\0\164\0\151\0\157\0\156\0\40\0\106\0\141\0\143\0\164\0\157\0\162\0\163)
- /Parent 126 0 R
- /Prev 130 0 R
- /Next 132 0 R
+ /Parent 128 0 R
+ /Prev 132 0 R
+ /Next 134 0 R
  /A 62 0 R
 >> endobj
-132 0 obj
+134 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\66\0\40\0\124\0\145\0\162\0\155\0\40\0\126\0\145\0\143\0\164\0\157\0\162\0\163)
- /Parent 126 0 R
- /Prev 131 0 R
- /Next 133 0 R
+ /Parent 128 0 R
+ /Prev 133 0 R
+ /Next 135 0 R
  /A 64 0 R
 >> endobj
-133 0 obj
+135 0 obj
 <<
  /Title (\376\377\0\67\0\56\0\67\0\40\0\104\0\145\0\154\0\145\0\164\0\145\0\144\0\40\0\104\0\157\0\143\0\165\0\155\0\145\0\156\0\164\0\163)
- /Parent 126 0 R
- /Prev 132 0 R
+ /Parent 128 0 R
+ /Prev 134 0 R
  /A 66 0 R
 >> endobj
-134 0 obj
+136 0 obj
 <<
  /Title (\376\377\0\70\0\40\0\114\0\151\0\155\0\151\0\164\0\141\0\164\0\151\0\157\0\156\0\163)
- /Parent 105 0 R
- /Prev 126 0 R
+ /Parent 107 0 R
+ /Prev 128 0 R
  /A 68 0 R
 >> endobj
-135 0 obj
+137 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F3
 /BaseFont /Helvetica-Bold
 /Encoding /WinAnsiEncoding >>
 endobj
-136 0 obj
+138 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F5
 /BaseFont /Times-Roman
 /Encoding /WinAnsiEncoding >>
 endobj
-137 0 obj
+139 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F6
 /BaseFont /Times-Italic
 /Encoding /WinAnsiEncoding >>
 endobj
-138 0 obj
+140 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F1
 /BaseFont /Helvetica
 /Encoding /WinAnsiEncoding >>
 endobj
-139 0 obj
+141 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F9
 /BaseFont /Courier
 /Encoding /WinAnsiEncoding >>
 endobj
-140 0 obj
+142 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F2
 /BaseFont /Helvetica-Oblique
 /Encoding /WinAnsiEncoding >>
 endobj
-141 0 obj
+143 0 obj
 << /Type /Font
 /Subtype /Type1
 /Name /F7
@@ -918,19 +933,19 @@
 endobj
 1 0 obj
 << /Type /Pages
-/Count 20
-/Kids [6 0 R 53 0 R 70 0 R 72 0 R 74 0 R 76 0 R 78 0 R 80 0 R 82 0 R 84 0 R 86 0 R 88 0 R
90 0 R 92 0 R 94 0 R 96 0 R 98 0 R 100 0 R 102 0 R 104 0 R ] >>
+/Count 21
+/Kids [6 0 R 53 0 R 70 0 R 72 0 R 74 0 R 76 0 R 78 0 R 80 0 R 82 0 R 84 0 R 86 0 R 88 0 R
90 0 R 92 0 R 94 0 R 96 0 R 98 0 R 100 0 R 102 0 R 104 0 R 106 0 R ] >>
 endobj
 2 0 obj
 << /Type /Catalog
 /Pages 1 0 R
- /Outlines 105 0 R
+ /Outlines 107 0 R
  /PageMode /UseOutlines
  >>
 endobj
 3 0 obj
 << 
-/Font << /F3 135 0 R /F5 136 0 R /F1 138 0 R /F6 137 0 R /F9 139 0 R /F2 140 0 R /F7
141 0 R >> 
+/Font << /F3 137 0 R /F5 138 0 R /F1 140 0 R /F6 139 0 R /F9 141 0 R /F2 142 0 R /F7
143 0 R >> 
 /ProcSet [ /PDF /ImageC /Text ] >> 
 endobj
 9 0 obj
@@ -1107,82 +1122,82 @@
 /D [104 0 R /XYZ 85.0 227.4 null]
 >>
 endobj
-105 0 obj
+107 0 obj
 <<
- /First 106 0 R
- /Last 134 0 R
+ /First 108 0 R
+ /Last 136 0 R
 >> endobj
 xref
-0 142
+0 144
 0000000000 65535 f 
-0000049768 00000 n 
-0000049963 00000 n 
-0000050056 00000 n 
+0000050671 00000 n 
+0000050874 00000 n 
+0000050967 00000 n 
 0000000015 00000 n 
 0000000071 00000 n 
 0000001280 00000 n 
 0000001400 00000 n 
 0000001572 00000 n 
-0000050208 00000 n 
+0000051119 00000 n 
 0000001707 00000 n 
-0000050271 00000 n 
+0000051182 00000 n 
 0000001842 00000 n 
-0000050337 00000 n 
+0000051248 00000 n 
 0000001979 00000 n 
-0000050401 00000 n 
+0000051312 00000 n 
 0000002116 00000 n 
-0000050467 00000 n 
+0000051378 00000 n 
 0000002253 00000 n 
-0000050533 00000 n 
+0000051444 00000 n 
 0000002390 00000 n 
-0000050599 00000 n 
+0000051510 00000 n 
 0000002527 00000 n 
-0000050663 00000 n 
+0000051574 00000 n 
 0000002664 00000 n 
-0000050727 00000 n 
+0000051638 00000 n 
 0000002801 00000 n 
-0000050793 00000 n 
+0000051704 00000 n 
 0000002938 00000 n 
-0000050859 00000 n 
+0000051770 00000 n 
 0000003075 00000 n 
-0000050925 00000 n 
+0000051836 00000 n 
 0000003212 00000 n 
-0000050991 00000 n 
+0000051902 00000 n 
 0000003349 00000 n 
-0000051057 00000 n 
+0000051968 00000 n 
 0000003486 00000 n 
-0000051121 00000 n 
+0000052032 00000 n 
 0000003622 00000 n 
-0000051187 00000 n 
+0000052098 00000 n 
 0000003759 00000 n 
-0000051251 00000 n 
+0000052162 00000 n 
 0000003895 00000 n 
-0000051317 00000 n 
+0000052228 00000 n 
 0000004032 00000 n 
-0000051381 00000 n 
+0000052292 00000 n 
 0000004169 00000 n 
-0000051447 00000 n 
+0000052358 00000 n 
 0000004305 00000 n 
-0000051513 00000 n 
+0000052424 00000 n 
 0000004442 00000 n 
-0000051577 00000 n 
+0000052488 00000 n 
 0000004578 00000 n 
 0000005297 00000 n 
 0000005420 00000 n 
 0000005489 00000 n 
-0000051643 00000 n 
+0000052554 00000 n 
 0000005622 00000 n 
-0000051707 00000 n 
+0000052618 00000 n 
 0000005755 00000 n 
-0000051771 00000 n 
+0000052682 00000 n 
 0000005888 00000 n 
-0000051835 00000 n 
+0000052746 00000 n 
 0000006021 00000 n 
-0000051899 00000 n 
+0000052810 00000 n 
 0000006154 00000 n 
-0000051964 00000 n 
+0000052875 00000 n 
 0000006286 00000 n 
-0000052029 00000 n 
+0000052940 00000 n 
 0000006419 00000 n 
 0000008567 00000 n 
 0000008675 00000 n 
@@ -1218,50 +1233,52 @@
 0000039760 00000 n 
 0000041446 00000 n 
 0000041556 00000 n 
-0000043481 00000 n 
-0000052094 00000 n 
-0000043591 00000 n 
-0000043791 00000 n 
-0000044009 00000 n 
-0000044215 00000 n 
-0000044423 00000 n 
-0000044591 00000 n 
-0000044791 00000 n 
-0000044949 00000 n 
-0000045124 00000 n 
-0000045365 00000 n 
+0000043445 00000 n 
+0000043555 00000 n 
+0000044384 00000 n 
+0000053005 00000 n 
+0000044494 00000 n 
+0000044694 00000 n 
+0000044912 00000 n 
+0000045118 00000 n 
+0000045326 00000 n 
 0000045494 00000 n 
-0000045648 00000 n 
-0000045802 00000 n 
-0000045946 00000 n 
-0000046096 00000 n 
-0000046237 00000 n 
-0000046477 00000 n 
-0000046659 00000 n 
-0000046832 00000 n 
-0000047035 00000 n 
-0000047223 00000 n 
-0000047475 00000 n 
-0000047616 00000 n 
-0000047825 00000 n 
-0000048011 00000 n 
-0000048185 00000 n 
-0000048430 00000 n 
-0000048621 00000 n 
-0000048827 00000 n 
-0000048988 00000 n 
-0000049102 00000 n 
-0000049213 00000 n 
-0000049325 00000 n 
-0000049434 00000 n 
-0000049541 00000 n 
-0000049658 00000 n 
+0000045694 00000 n 
+0000045852 00000 n 
+0000046027 00000 n 
+0000046268 00000 n 
+0000046397 00000 n 
+0000046551 00000 n 
+0000046705 00000 n 
+0000046849 00000 n 
+0000046999 00000 n 
+0000047140 00000 n 
+0000047380 00000 n 
+0000047562 00000 n 
+0000047735 00000 n 
+0000047938 00000 n 
+0000048126 00000 n 
+0000048378 00000 n 
+0000048519 00000 n 
+0000048728 00000 n 
+0000048914 00000 n 
+0000049088 00000 n 
+0000049333 00000 n 
+0000049524 00000 n 
+0000049730 00000 n 
+0000049891 00000 n 
+0000050005 00000 n 
+0000050116 00000 n 
+0000050228 00000 n 
+0000050337 00000 n 
+0000050444 00000 n 
+0000050561 00000 n 
 trailer
 <<
-/Size 142
+/Size 144
 /Root 2 0 R
 /Info 4 0 R
 >>
 startxref
-52148
+53059
 %%EOF

Modified: lucene/java/trunk/src/site/src/documentation/content/xdocs/fileformats.xml
URL: http://svn.apache.org/viewvc/lucene/java/trunk/src/site/src/documentation/content/xdocs/fileformats.xml?rev=761886&r1=761885&r2=761886&view=diff
==============================================================================
--- lucene/java/trunk/src/site/src/documentation/content/xdocs/fileformats.xml (original)
+++ lucene/java/trunk/src/site/src/documentation/content/xdocs/fileformats.xml Sat Apr  4
10:18:05 2009
@@ -1802,12 +1802,23 @@
 
         <section id="Limitations"><title>Limitations</title>
 
-            <p>There
-                are a few places where these file formats limit the maximum number of
-                terms and documents to a 32-bit quantity, or to approximately 4
-                billion. This is not today a problem, but, in the long term,
-                probably will be. These should therefore be replaced with either
-                UInt64 values, or better yet, with VInt values which have no limit.
+            <p>
+	      When referring to term numbers, Lucene's current
+	      implementation uses a Java <code>int</code>, which means
+	      the maximum number of unique terms in any single index
+	      segment is 2,147,483,648.  This is technically not a
+	      limitation of the index file format, just of Lucene's
+	      current implementation.
+	    </p>
+	    <p>
+	      Similarly, Lucene uses a Java <code>int</code> to refer
+	      to document numbers, and the index file format uses an
+	      <code>Int32</code> on-disk to store document numbers.
+	      This is a limitation of both the index file format and
+	      the current implementation.  Eventually these should be
+	      replaced with either <code>UInt64</code> values, or
+	      better yet, <code>VInt</code> values which have no
+	      limit.
             </p>
 
         </section>



Mime
View raw message