lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <>
Subject Re: Unexpected end in indexing HTML file
Date Tue, 20 Jan 2004 00:31:59 GMT
On Jan 19, 2004, at 7:27 PM, Syrén Per wrote:
> Hi all,
> Have a question concerning indexing of HTML files.
> One of the files I'm trying to index have a <input type="image" ...> 
> tag
> that also contain a call to a javascript with a string argument that is
> about 1300 characters long. At this point Lucene seems to stop 
> indexing the
> remaining part the current document, but do index the other files in 
> the
> same directory.
> How do I workaround this?

Seems unlikely, but IndexWriter.maxFieldLength is set to 10,000.  This 
is 10,000 terms maximum per field.  Is it possible you are exceeding 

What symptoms lead you to believe it is stopping indexing at that point?


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message