lucene-java-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-java Wiki] Update of "ImproveIndexingSpeed" by MikeMcCandless
Date Thu, 09 Jul 2009 18:55:44 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-java Wiki" for change notification.

The following page has been changed by MikeMcCandless:
http://wiki.apache.org/lucene-java/ImproveIndexingSpeed

The comment on the change is:
Add bullet about getting the bulk-byte-copy optimization to apply when merging.

------------------------------------------------------------------------------
  
   Note that you cannot re-use a single Field instance within a Document, and, you should
not change a Field's value until the Document containing that Field has been added to the
index.  See [http://lucene.apache.org/java/docs/api/org/apache/lucene/document/Field.html
Field] for details.
  
+  * '''Always add fields in the same order to your Document, when using stored fields or
term vectors'''
+ 
+  Lucene's merging has an optimization whereby stored fields and term vectors can be bulk-byte-copied,
but the optimization only applies if the field name -> number mapping is the same across
segments.  Future Lucene versions may attempt to assign the same mapping automatically, but
until then the only way to get the same mapping is to always add the same fields in the same
order to each document you index.
  
   * '''Re-use a single Token instance in your analyzer'''
  

Mime
View raw message