hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [incubator-hudi] branch asf-site updated: [HUDI-543] release notes for 0.5.1 (#1277)
Date Fri, 31 Jan 2020 07:28:05 GMT
This is an automated email from the ASF dual-hosted git repository.

leesf pushed a commit to branch asf-site
in repository

The following commit(s) were added to refs/heads/asf-site by this push:
     new 20ede76  [HUDI-543] release notes for 0.5.1 (#1277)
20ede76 is described below

commit 20ede76c4c79c0804518a4fe148b8fcd48391f5c
Author: leesf <>
AuthorDate: Fri Jan 31 15:27:57 2020 +0800

    [HUDI-543] release notes for 0.5.1 (#1277)
 docs/_pages/ | 41 +++++++++++++++++++++++++++++++++++++++++
 1 file changed, 41 insertions(+)

diff --git a/docs/_pages/ b/docs/_pages/
index a2df353..88a5d3e 100644
--- a/docs/_pages/
+++ b/docs/_pages/
@@ -6,6 +6,47 @@ toc: true
 last_modified_at: 2019-12-30T15:59:57-04:00
+## [Release 0.5.1-incubating]
+### Download Information
+ * Source Release : [Apache Hudi(incubating) 0.5.1-incubating Source Release](
+ * Apache Hudi (incubating) jars corresponding to this release is available [here](;quick~hudi)
+### Release Highlights
+* Dependency Version Upgrades
+    * Upgrade from Spark 2.1.0 to Spark 2.4.4
+    * Upgrade from Avro 1.7.7 to Avro 1.8.2
+    * Upgrade from Parquet 1.8.1 to Parquet 1.10.1
+    * Upgrade from Kafka to Kafka 2.0.0 as a result of updating spark-streaming-kafka
artifact from 0.8_2.11/2.12 to 0.10_2.11/2.12.
+* **IMPORTANT** This version requires your runtime spark version to be upgraded to 2.4+.
+* Hudi now supports both Scala 2.11 and Scala 2.12, please refer to [Build with Scala 2.12](
to build with Scala 2.12.
+Also, the packages hudi-spark, hudi-utilities, hudi-spark-bundle and hudi-utilities-bundle
are changed correspondingly to hudi-spark_{scala_version}, hudi-spark_{scala_version}, hudi-utilities_{scala_version},
hudi-spark-bundle_{scala_version} and hudi-utilities-bundle_{scala_version}.
+Note that scala_version here is one of (2.11, 2.12).
+* With 0.5.1, we added functionality to stop using renames for Hudi timeline metadata operations.
This feature is automatically enabled for newly created Hudi tables. For existing tables,
this feature is turned off by default. Please read this [section](,
before enabling this feature for existing hudi tables.
+To enable the new hudi timeline layout which avoids renames, use the write config "hoodie.timeline.layout.version=1".
Alternatively, you can use "repair overwrite-hoodie-props" to append the line "hoodie.timeline.layout.version=1"
to Note that in any case, upgrade hudi readers (query engines) first with
0.5.1-incubating release before upgrading writer.
+* CLI supports `repair overwrite-hoodie-props` to overwrite the table's
with specified file, for one-time updates to table name or even enabling the new timeline
layout above. Note that few queries may temporarily fail while the overwrite happens (few
+* DeltaStreamer CLI parameter for capturing table type is changed from --storage-type to
--table-type. Refer to [wiki](
with more latest terminologies.
+* Configuration Value change for Kafka Reset Offset Strategies. Enum values are changed from
LARGEST to LATEST, SMALLEST to EARLIEST for configuring Kafka reset offset strategies with
configuration(auto.offset.reset) in deltastreamer.
+* When using spark-shell to give a quick peek at Hudi, please provide `--packages org.apache.spark:spark-avro_2.11:2.4.4`,
more details would refer to [latest quickstart docs](
+* Key generator moved to separate package under org.apache.hudi.keygen. If you are using
overridden key generator classes (configuration ("hoodie.datasource.write.keygenerator.class"))
that comes with hudi package, please ensure the fully qualified class name is changed accordingly.
+* Hive Sync tool will register RO tables for MOR with a _ro suffix, so query with _ro suffix.
You would use `--skip-ro-suffix` in sync config in sync config to retain the old naming without
the _ro suffix.
+* With 0.5.1, hudi-hadoop-mr-bundle which is used by query engines such as presto and hive
includes shaded avro package to support hudi real time queries through these engines. Hudi
supports pluggable logic for merging of records. Users provide their own implementation of
+If you are using this feature, you need to relocate the avro dependencies in your custom
record payload class to be consistent with internal hudi shading. You need to add the following
relocation when shading the package containing the record payload implementation.
+ ```xml
+    <pattern>org.apache.avro.</pattern>
+    <shadedPattern></shadedPattern>
+ ```
+ * Better delete support in DeltaStreamer, please refer to [blog](
for more info.
+ * Support for AWS Database Migration Service(DMS) in DeltaStreamer, please refer to [blog](
for more info.
+ * Support for DynamicBloomFilter. This is turned off by default, to enable the DynamicBloomFilter,
please use the index config "hoodie.bloom.index.filter.type=DYNAMIC_V0".
+### Raw Release Notes
+ The raw release notes are available [here](
 ## [Release 0.5.0-incubating](
 ### Download Information

View raw message