hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brandon Scheller (Jira)" <j...@apache.org>
Subject [jira] [Created] (HUDI-520) Decide on keyGenerator strategy for handling null/empty recordkeys
Date Sat, 11 Jan 2020 00:52:00 GMT
Brandon Scheller created HUDI-520:
-------------------------------------

             Summary: Decide on keyGenerator strategy for handling null/empty recordkeys 
                 Key: HUDI-520
                 URL: https://issues.apache.org/jira/browse/HUDI-520
             Project: Apache Hudi (incubating)
          Issue Type: Bug
            Reporter: Brandon Scheller


Currently key-generator implementations write out "__null__" for null values and "__empty__"
for empty in order to provide a distinction between the two. This can add extra overhead to
large datalakes that might not use this distinction.

This Jira is to decide on a consistent strategy for handling null/empty record keys in key
generators.

 

The current strategy can be seen within ComplexKeyGenerator



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message