hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [incubator-hudi] bvaradar commented on issue #1384: [SUPPORT] Hudi datastore missing updates for many records
Date Wed, 11 Mar 2020 17:43:26 GMT
bvaradar commented on issue #1384: [SUPPORT] Hudi datastore missing updates for many records
URL: https://github.com/apache/incubator-hudi/issues/1384#issuecomment-597773490
 
 
   @utk-spartan : HUDI-409 is for MOR tables and unrelated to your scenario. Hmmm. COW is
one of the most battle-tested part of Hudi :) and hence it is very surprising. Just to be
clear, You are using hudi 0.5.x and started with a clean dataset. right  ?  I am assuming
these are valid updates and not deletes. Right ? One way to debug is : use the cli to print
out commit stats for commits at file level to check if you are seeing a drop in numWrites
(and other stats numUpdateWrites, numDeletes.... are sane). You may have to write a custom
script to suit your need or work with existing cli commands. 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message