hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yanjia Gary Li (Jira)" <>
Subject [jira] [Created] (HUDI-610) Impala nea real time table support
Date Thu, 13 Feb 2020 18:52:00 GMT
Yanjia Gary Li created HUDI-610:

             Summary: Impala nea real time table support
                 Key: HUDI-610
             Project: Apache Hudi (incubating)
          Issue Type: New Feature
            Reporter: Yanjia Gary Li
            Assignee: Yanjia Gary Li

Impala uses the JAVA based module call "frontend" to list all the files to scan and let the
C++ based "backend" to do all the file scanning. 

Merge Avro and Parquet could be difficult because it might need to have a custom merging logic
like RealtimeCompactedRecordReader to be implemented in backend using C++, but I think it
will be doable to have something like RealtimeUnmergedRecordReader which only need some changes
in the frontend. 

This message was sent by Atlassian Jira

View raw message