lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Unexpected end in indexing HTML file
Date Tue, 20 Jan 2004 00:29:29 GMT
Look at the IndexWriter Javadocs.  One of the fields allows you to set
maximum term length.  This may also be a problem with the HTML parser
you are using.  You didn't share a lot of details, so I cannot help
more.

Otis

--- Syrén_Per <Per.Syren@lm.se> wrote:
> Hi all,
> 
> Have a question concerning indexing of HTML files.
> 
> One of the files I'm trying to index have a <input type="image" ...>
> tag
> that also contain a call to a javascript with a string argument that
> is
> about 1300 characters long. At this point Lucene seems to stop
> indexing the
> remaining part the current document, but do index the other files in
> the
> same directory.
> 
> How do I workaround this?
> 
> psy 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 


__________________________________
Do you Yahoo!?
Yahoo! Hotjobs: Enter the "Signing Bonus" Sweepstakes
http://hotjobs.sweepstakes.yahoo.com/signingbonus

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message