hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [incubator-hudi] bhasudha commented on a change in pull request #1279: [HUDI-577] update docker demo page and quick start pages
Date Thu, 30 Jan 2020 00:37:39 GMT
bhasudha commented on a change in pull request #1279: [HUDI-577] update docker demo page and
quick start pages
URL: https://github.com/apache/incubator-hudi/pull/1279#discussion_r372707127
 
 

 ##########
 File path: docs/_docs/2_3_querying_data.md
 ##########
 @@ -148,5 +148,5 @@ Additionally, `HoodieReadClient` offers the following functionality using
Hudi's
 
 ## Presto
 
-Presto is a popular query engine, providing interactive query performance. Presto currently
supports only read optimized querying on Hudi tables. 
+Presto is a popular query engine, providing interactive query performance. Presto currently
supports only read optimized queries on Hudi tables. 
 
 Review comment:
   > > Presto currently supports only read optimized queries
   > 
   > this is snapshot query right.. Can you please make another pass for any lines we missed
like this
   
   @vinothchandar I looked at this. This is my understanding. Please correct if I am wrong.
   When we refer to "snapshot" it means  queries can see data committed as of that point in
time. For COW table type, this refers to latest parquet files for each file group. For MOR
table type snapshot queries can see real-time data merged from base Parquet file and delta
log file (AVRO files). Wrt Presto specifically, we are not supporting reading from avro files
yet (referring to https://issues.apache.org/jira/browse/HUDI-305) . So I kept this as ReadOptimized
instead of Snapshot.  Later when we support this, we can switch to Snapshot query support
in Presto. What do you think?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message