lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gilberto Rodriguez <gilberto.rodrig...@conviveon.com>
Subject Re: Problem Indexing Large Document Field
Date Wed, 26 May 2004 22:13:02 GMT
Yeap, that was the problem...  I just needed to increase the  
maxFieldLength number.

Thanks...


On May 26, 2004, at 5:56 PM, wallen@Cyveillance.com wrote:

> http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/index/ 
> IndexWrite
> r.html#DEFAULT_MAX_FIELD_LENGTH
>
> maxFieldLength
> public int maxFieldLengthThe maximum number of terms that will be  
> indexed
> for a single field in a document. This limits the amount of memory  
> required
> for indexing, so that collections with very large files will not crash  
> the
> indexing process by running out of memory.
> Note that this effectively truncates large documents, excluding from  
> the
> index terms that occur further in the document. If you know your source
> documents are large, be sure to set this value high enough to  
> accomodate the
> expected size. If you set it to Integer.MAX_VALUE, then the only limit  
> is
> your memory, but you should anticipate an OutOfMemoryError.
>
> By default, no more than 10,000 terms will be indexed for a field.
>
>
>
> -----Original Message-----
> From: Gilberto Rodriguez [mailto:gilberto.rodriguez@conviveon.com]
> Sent: Wednesday, May 26, 2004 4:04 PM
> To: lucene-user@jakarta.apache.org
> Subject: Problem Indexing Large Document Field
>
>
> I am trying to index a field in a Lucene document with about 90,000
> characters. The problem is that it only indexes part of the document.
> It seems to only index about 65,00 characters. So, if I search on terms
> that are at the beginning of the text, the search works, but it fails
> for terms that are at the end of the document.
>
> Is there a limitation on how many characters can be stored in a
> document field? Any help would be appreciated, thanks....
>
>
> Gilberto Rodriguez
> Software Engineer
>    
> 370 CenterPointe Circle, Suite 1178
> Altamonte Springs, FL 32701-3451
>    
> 407.339.1177 (Ext.112) • phone
> 407.339.6704 • fax
> gilberto.rodriguez@conviveon.com • email
> www.conviveon.com • web
>  
> This e-mail contains legally privileged and confidential information
> intended only for the individual or entity named within the message. If
> the reader of this message is not the intended recipient, or the agent
> responsible to deliver it to the intended recipient, the recipient is
> hereby notified that any review, dissemination, distribution or copying
> of this communication is prohibited. If this communication was received
> in error, please notify me by reply e-mail and delete the original
> message.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>
Gilberto Rodriguez
Software Engineer
   
370 CenterPointe Circle, Suite 1178
Altamonte Springs, FL 32701-3451
   
407.339.1177 (Ext.112) • phone
407.339.6704 • fax
gilberto.rodriguez@conviveon.com • email
www.conviveon.com • web
 
This e-mail contains legally privileged and confidential information  
intended only for the individual or entity named within the message. If  
the reader of this message is not the intended recipient, or the agent  
responsible to deliver it to the intended recipient, the recipient is  
hereby notified that any review, dissemination, distribution or copying  
of this communication is prohibited. If this communication was received  
in error, please notify me by reply e-mail and delete the original  
message.


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message