hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yanjia Gary Li (Jira)" <j...@apache.org>
Subject [jira] [Created] (HUDI-610) Impala nea real time table support
Date Thu, 13 Feb 2020 18:52:00 GMT
Yanjia Gary Li created HUDI-610:
-----------------------------------

             Summary: Impala nea real time table support
                 Key: HUDI-610
                 URL: https://issues.apache.org/jira/browse/HUDI-610
             Project: Apache Hudi (incubating)
          Issue Type: New Feature
            Reporter: Yanjia Gary Li
            Assignee: Yanjia Gary Li


Impala uses the JAVA based module call "frontend" to list all the files to scan and let the
C++ based "backend" to do all the file scanning. 

Merge Avro and Parquet could be difficult because it might need to have a custom merging logic
like RealtimeCompactedRecordReader to be implemented in backend using C++, but I think it
will be doable to have something like RealtimeUnmergedRecordReader which only need some changes
in the frontend. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message