lucene-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Lucene-java Wiki] Update of "ReleaseNote77" by jimczi
Date Thu, 07 Feb 2019 11:41:20 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-java Wiki" for change notification.

The "ReleaseNote77" page has been changed by jimczi:

Update the release notes for 7.7.0

- ## page was copied from ReleaseNote76
- = 14 December 2018, Apache Lucene™ 7.6.0 available =
+ = XX February 2019, Apache Lucene™ 7.7.0 available =
- The Lucene PMC is pleased to announce the release of Apache Lucene 7.6.0.
+ The Lucene PMC is pleased to announce the release of Apache Lucene 7.7.0.
  Apache Lucene is a high-performance, full-featured text search engine library written entirely
in Java. It is a technology suitable for nearly any application that requires full-text search,
especially cross-platform.
@@ -11, +10 @@
  Please read CHANGES.txt for a full list of new features and changes:
- == Lucene 7.6.0 Release Highlights: ==
+ == Lucene 7.7.0 Release Highlights: ==
  === Bug Fixes ===
-  * Index sorting corruption due to numeric overflow has been fixed. Indices affected by
this bug can be detected by running the CheckIndex command on a 7.6+ release distribution.
-  * Better tessellation processing of Polygons including graceful exceptions for detecting
invalid shapes.
+ * Fix LatLonShape WITHIN queries that fail with Multiple search Polygons that share the
+ * LatLonShape's within and disjoint queries can return false positives with indexed multi-shapes.
  === Improvements ===
-  * Points codec now supports selective indexing; the ability to designate dimensions as
as "data only" dimensions that do not affect construction of the index.
-  * New Simple WKT Shape Parser builds lucene geometries (polygons, lines, rectangles) from
WKT format.
-  * New LatLonShapeLineQuery queries indexed shapes with arbitrary lines.
-  * Performance in PerFieldMergeState#FilterFieldInfos has been improved from O(N) to O(1)
lookup time.
+ * ExitableDirectoryReader may now time out queries that run on points such as range queries
or geo queries.
+ * LUCENE-8527: StandardTokenizer and UAX29URLEmailTokenizer now support Unicode 9.0,
+   and provide Unicode UTS#51 v11.0 Emoji tokenization with the "<EMOJI>" token type.
+ * TopFieldCollector can now early-terminates queries when sorting by SortField.DOC.
+ * Speed up merging segments of points with data dimensions by only sorting on the indexed
+ * The KoreanTokenizer no longer splits unknown words on combining diacritics and detects
script boundaries more accurately with Character#UnicodeScript#of.
+ * Change LatLonShape encoding to use 4 bytes Per Dimension.
+ * BufferedUpdates now uses an optimized storage for buffering docvalues updates that
+   can safe up to 80% of the heap used compared to the previous implementation and uses non-object
+   based datastructures. (Simon Willnauer, Mike McCandless, Shai Erera, Adrien Grand)
+ * Moved to the default accepted overhead ratio for packet ints in DocValuesFieldUpdates
yields an up-to 4x performance improvement when applying doc values updates.
+ * Doc-value updates get applied faster by sorting with quicksort, rather than an in-place
mergesort, which needs to perform fewer swaps.
+ * Decrease I/O pressure when merging high dimensional points.
- Further details of changes are available in the change log available at:
+ Further details of changes are available in the change log available at:
  Please report any feedback to the mailing lists (

View raw message