lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Murat Yakici" <>
Subject Re: Term Limit?
Date Sat, 04 Apr 2009 11:55:10 GMT

I assume the total number of documents that you can index is also limited
by Java max int. Is this correct? Is there any way to index documents
beyond this number in a single index?


> I tentatively think you are correct: the file format itself does not
> impose this limitation.
> But in a least a couple places internally, Lucene uses a java int to
> hold the term number, which is actually a limit of 2,147,483,648
> terms.  I'll update fileformats.html for 2.9.

> Mike
> On Sat, Apr 4, 2009 at 2:56 AM, deminix <> wrote:
>> The file format page at the bottom cites that there is a 32 bit limit to
>> term numbers.  I fail to see where in the file formats documentation
>> that is
>> actually true.  Is the bottom of the page simply out of date?  I'm also
>> wondering whether the code may be a limiting factor even if the file
>> formats
>> are ok.
>> Thanks.
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

Murat Yakici
Department of Computer & Information Sciences
University of Strathclyde
Glasgow, UK
The University of Strathclyde is a charitable body, registered in Scotland,
with registration number SC015263.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message