lucene-java-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Lucene-java Wiki] Update of "ReleaseNote40alpha" by UweSchindler
Date Fri, 29 Jun 2012 12:21:38 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-java Wiki" for change notification.

The "ReleaseNote40alpha" page has been changed by UweSchindler:

add binary terms and IndexReader refactoring

     also sometimes avoid going to disk at all for terms that do not exist. Alternative term
     dictionary implementions are provided and pluggable via the Codec api.
+  * Indexed terms are no longer UTF-16 char sequences, instead terms can be any binary
+    value encoded as byte arrays. By default, text terms are now encoded as UTF-8
+    bytes. Sort order of terms is now defined by their binary value, which is identical
+    to UTF-8 sort order.
   * Substantially faster performance when using a Filter during searching.
   * File-system based directories can rate-limit the IO (MB/sec) of merge
@@ -73, +78 @@

   * Various in-memory data structures such as the term dictionary and FieldCache are represented
     more efficiently with less object overhead.
+  * All search logic is now required to work per segment, IndexReader was therefore refactored
+    differentiate between atomic and composite readers.
   * Lucene 4.0 provides a modular API, consolidating components such as Analyzers and Queries

     that were previously scattered across Lucene core, contrib, and Solr. These modules also
     include additional functionality such as UIMA analyzer integration and a completely reworked

View raw message