From dev-return-4280-archive-asf-public=cust-asf.ponee.io@hudi.apache.org Thu Aug 5 22:06:49 2021 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mxout1-he-de.apache.org (mxout1-he-de.apache.org [95.216.194.37]) by mx-eu-01.ponee.io (Postfix) with ESMTPS id 2305518068A for ; Fri, 6 Aug 2021 00:06:49 +0200 (CEST) Received: from mail.apache.org (mailroute1-lw-us.apache.org [207.244.88.153]) by mxout1-he-de.apache.org (ASF Mail Server at mxout1-he-de.apache.org) with SMTP id F18C461F6A for ; Thu, 5 Aug 2021 22:06:47 +0000 (UTC) Received: (qmail 47024 invoked by uid 500); 5 Aug 2021 22:06:47 -0000 Mailing-List: contact dev-help@hudi.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hudi.apache.org Delivered-To: mailing list dev@hudi.apache.org Received: (qmail 46996 invoked by uid 99); 5 Aug 2021 22:06:46 -0000 Received: from spamproc1-he-de.apache.org (HELO spamproc1-he-de.apache.org) (116.203.196.100) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Aug 2021 22:06:46 +0000 Received: from localhost (localhost [127.0.0.1]) by spamproc1-he-de.apache.org (ASF Mail Server at spamproc1-he-de.apache.org) with ESMTP id D56081FF407; Thu, 5 Aug 2021 22:06:45 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamproc1-he-de.apache.org X-Spam-Flag: NO X-Spam-Score: -0.002 X-Spam-Level: X-Spam-Status: No, score=-0.002 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.2, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamproc1-he-de.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-ec2-va.apache.org ([116.203.227.195]) by localhost (spamproc1-he-de.apache.org [116.203.196.100]) (amavisd-new, port 10024) with ESMTP id 6ETv1SZ5ZuQk; Thu, 5 Aug 2021 22:06:44 +0000 (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=209.85.222.182; helo=mail-qk1-f182.google.com; envelope-from=n.siva.b@gmail.com; receiver= Received: from mail-qk1-f182.google.com (mail-qk1-f182.google.com [209.85.222.182]) by mx1-ec2-va.apache.org (ASF Mail Server at mx1-ec2-va.apache.org) with ESMTPS id 7C2F8BD348; Thu, 5 Aug 2021 22:06:44 +0000 (UTC) Received: by mail-qk1-f182.google.com with SMTP id t66so7979307qkb.0; Thu, 05 Aug 2021 15:06:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=JNRajTYN6miSM3c2Rpg5dU42phW6nrUwHMB+6HK8XUQ=; b=rOnZk5QfDMZLUCPTLgPnjyd9dhpx0417/zA4UoQBLCrNgLcM9gbJ2v8iqUFBTdPzKC g90SDDvktFhVBNF2n+UEoFFbp9N31jg2xMR10mg9SdhaHmbo+d11DzrBP/r2Js0FmguW CywvL7NWRgNwl7UHCnTyAZInjeobAwQcemwD0XllWFfIQEUDzLDbVDa2v3HzCuKuzS8J 3OEYWLRQcAx3JtwKlLe1xXNanty8rkUuGqGzTvRBgC7NwBrxO009ml6rHCGZrNDmq5ur OhaZAVOguMRS9+5ReJdEvH1FdEB/eaLfJbZa1M28sgh/kTmx/OimlwLJk2Fr+hcfWobn ce0g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=JNRajTYN6miSM3c2Rpg5dU42phW6nrUwHMB+6HK8XUQ=; b=n2zyYqkNmcqnA8e85qfPOewyZMDVFh5xFK1PfchHzcCO66CB93j4Ke/QtF4EppqvzI mq+ubXZyS/N4MNBWgRfqRxv/4KdI9gdmaHqXd9l4AQVJWe7ZqqPQpdgj425BoxErHyXC cPkzAqmDAhk1PnsM2iL67t8cg1gzqfRP6RmFwS9ydSJURrZ512noGWlUy7G7ypqIfuge 8ialv6KrUUAvfrNgmdxPojsZhzIdeuDIfrLBhe6d1qHr6PbTzYgDfsjd132HWqiQkbK5 DfaPSW44UUvhWM5bHn0lwBayXKXb3FVxJ/MB+X9lqbtTLwbImYjve9DjZrs9154DXccp CmrA== X-Gm-Message-State: AOAM531Qp2orHhMs4b2GdOp9rzzNZp0QsXyfeEr0GN0NZVrIfNqHgiHU N3iql5BqiFYqtoBZ7cL7umnpKs3wvxrHIuJQCFgzml+y74MaaA== X-Google-Smtp-Source: ABdhPJy5gjzJ9bSkRaCuB53zy+GfqBMIput/5BBStaQi6GTwIhbZy+QMeZw4u6Y5UDvtdq3/kic6PnEEGlzPhBWEh04= X-Received: by 2002:a05:620a:20ce:: with SMTP id f14mr7315026qka.367.1628201197735; Thu, 05 Aug 2021 15:06:37 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Sivabalan Date: Thu, 5 Aug 2021 18:06:26 -0400 Message-ID: Subject: Re: [DISCUSS] Hudi 0.9.0 Release To: dev@hudi.apache.org Cc: users@hudi.apache.org Content-Type: multipart/alternative; boundary="00000000000003946605c8d7221e" --00000000000003946605c8d7221e Content-Type: text/plain; charset="UTF-8" Yeah, end of next week sounds good. Here are the status updates wrt patches I am involved. Plan to get these in by early next week. - [HUDI-2208] Support Bulk Insert For Spark Sql (Owner: pengzhiwei) - [HUDI-2250] Bulk insert support for tables w/ primary key (Owner: Sivabalan) - [HUDI-1842] Spark Sql Support For The Exists Hoodie Table (Owner: pengzhiwei) - [HUDI-1138] Re-implement marker files via timeline server (Owner: Ethan Guo) - [HUDI-1129] Improving schema evolution support in hudi (Owner: Sivabalan) Mid next week: - [HUDI-2063] Add Doc For Spark Sql (DML and DDL) integration With Hudi (Owner: pengzhiwei) Waiting for reviews. Will try to get it in by early next week. If we couldn't get this in, probably will skip this release. - [HUDI-1763] Fixing honoring of Ordering val in DefaultHoodieRecordPayload.preCombine (Owner: Sivabalan) Removed from release blockers: - [HUDI-1887] Setting default value to false for enabling schema post processor (Owner: Sivabalan) - [HUDI-1850] Fixing read of a empty table but with failed write (Owner: Sivabalan) On Thu, Aug 5, 2021 at 11:17 AM Vinoth Chandar wrote: > Any other thoughts? Love to lock this date down sooner than later. > > Thanks > Vinoth > > On Tue, Aug 3, 2021 at 11:35 PM Udit Mehrotra wrote: > > > Agreed Vinoth. End of next week seems reasonable as a hard deadline for > > cutting the RC. > > > > If anyone thinks otherwise or needs more time, feel free to chime in. > > > > On Tue, Aug 3, 2021 at 8:10 PM Vinoth Chandar wrote: > > > > > Thanks Udit! I propose we set end of next week as a hard deadline for > > > cutting the RC. Any thoughts? > > > > > > A good amount of progress is being made on these blockers, I think. > > > > > > > > > On Tue, Aug 3, 2021 at 5:13 PM Udit Mehrotra > wrote: > > > > > > > Hi Community, > > > > > > > > As we draw close to doing Hudi 0.9.0 release, I am happy to share a > > > summary > > > > of the key features/improvements that would be going in the release > and > > > the > > > > current blockers for everyone's visibility. > > > > > > > > *Highlights* > > > > > > > > - [HUDI-1729] Asynchronous Hive sync and commits cleaning for > Flink > > > > writer > > > > - [HUDI-1738] Detect and emit deleted records for Flink MOR table > > > > streaming read > > > > - [HUDI-1867] Support streaming reads for Flink COW table > > > > - [HUDI-1908] Global index for flink writer > > > > - [HUDI-1788] Support Insert Overwrite with Flink Writer > > > > - [HUDI-2209] Bulk insert for flink writer > > > > - [HUDI-1591] Support querying using non-globbed paths for Hudi > > Spark > > > > DataSource queries > > > > - [HUDI-1591] Partition pruning support for read optimized queries > > via > > > > Hudi Spark DataSource > > > > - [HUDI-1415] Register Hudi Table as a Spark DataSource Table with > > > > metastore. Queries via Spark SQL will be routed through Hudi > > > DataSource > > > > (instead of InputFormat), thus making it more performant due to > > > Spark's > > > > native/optimized readers > > > > - [HUDI-1591] Partition pruning support for snapshot queries via > > Hudi > > > > Spark DataSource > > > > - [HUDI-1658] DML and DDL support via Spark SQL > > > > - [HUDI-1790] Add SqlSource for DeltaStreamer to support backfill > > use > > > > cases: > > > > - [HUDI-251] Add JDBC Source support for DeltaStreamer > > > > - [HUDI-1910] Support Kafka based checkpointing for > > > HoodieDeltaStreamer > > > > - [HUDI-1371] Support metadata based listing for Spark DataSource > > and > > > > Spark SQL > > > > - [HUDI-2013] [HUDI-1717] [HUDI-2089] [HUDI-2016] Improvements to > > > > Metadata based listing > > > > - HUDI-89] Introduce a HoodieConfig/ConfigProperty framework to > > bring > > > > all configs under one roof > > > > - [HUDI-2124] Grafana dashboard for Hudi > > > > - [HUDI-1104] [HUDI-1105] [HUDI-2009] Improvements to Bulk Insert > > via > > > > row writing > > > > - [HUDI-1483] Async clustering for Delta Streamer > > > > - [HUDI-2235] Add virtual key support to Hudi > > > > - [HUDI-1848] Add support for Hive Metastore in Hive-sync-tool > > > > - In addition, there have been significant improvements and bug > > fixes > > > to > > > > improve the overall stability of Flink Hudi integration > > > > > > > > *Current Blockers* > > > > > > > > - [HUDI-2208] Support Bulk Insert For Spark Sql (Owner: > pengzhiwei) > > > > - [HUDI-1256] Follow on improvements to HFile tables for metadata > > > based > > > > listing (Owner: None) > > > > - [HUDI-2063] Add Doc For Spark Sql (DML and DDL) integration With > > > Hudi > > > > (Owner: pengzhiwei) > > > > - [HUDI-1842] Spark Sql Support For The Exists Hoodie Table > (Owner: > > > > pengzhiwei) > > > > - [HUDI-1138] Re-implement marker files via timeline server > (Owner: > > > > Ethan Guo) > > > > - [HUDI-1985] Website redesign implementation (Owner: Vinoth > > > > Govindarajan) > > > > - [HUDI-2232] MERGE INTO fails with table having nested struct > > (Owner: > > > > pengzhiwei) > > > > - [HUDI-1468] incremental read support with clustering (Owner: > > Liwei) > > > > - [HUDI-2250] Bulk insert support for tables w/ primary key > (Owner: > > > > None) > > > > - [HUDI-2222] [SQL] Test catalog integration (Owner: Sagar Sumit) > > > > - [HUDI-2221] [SQL] Functionality testing with Spark 2 (Owner: > Sagar > > > > Sumit) > > > > - [HUDI-1887] Setting default value to false for enabling schema > > post > > > > processor (Owner: Sivabalan) > > > > - [HUDI-1850] Fixing read of a empty table but with failed write > > > (Owner: > > > > Sivabalan) > > > > - [HUDI-2151] Enable defaults for out of box performance (Owner: > > Udit > > > > Mehrotra) > > > > - [HUDI-2119] Ensure the rolled-back instance was previously > synced > > to > > > > the Metadata Table when syncing a Rollback Instant (Owner: > Prashant > > > > Wason) > > > > - [HUDI-1458] Support custom clustering strategies and preserve > > commit > > > > time to support incremental read (Owner: Satish Kotha) > > > > - [HUDI-1763] Fixing honoring of Ordering val in > > > > DefaultHoodieRecordPayload.preCombine (Owner: Sivabalan) > > > > - [HUDI-1129] Improving schema evolution support in hudi (Owner: > > > > Sivabalan) > > > > - [HUDI-2120] [DOC] Update docs about schema in flink sql > > > configuration > > > > (Owner: Xianghu Wang) > > > > - [HUDI-2182] Support Compaction Command For Spark Sql (Owner: > > > > pengzhiwei) > > > > > > > > Please respond to the thread if you think that I have missed > capturing > > > any > > > > of the highlights or blockers for Hudi 0.9.0 release. For the owners > of > > > > these release blockers, can you please provide a specific timeline > you > > > are > > > > willing to commit to for finishing these so we can cut an RC ? > > > > > > > > Thanks, > > > > Udit > > > > > > > > > > -- Regards, -Sivabalan --00000000000003946605c8d7221e--