hudi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinoth Chandar (Jira)" <j...@apache.org>
Subject [jira] [Commented] (HUDI-686) Implement BloomIndexV2 that does not depend on memory caching
Date Thu, 19 Mar 2020 21:21:00 GMT

    [ https://issues.apache.org/jira/browse/HUDI-686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062945#comment-17062945
] 

Vinoth Chandar commented on HUDI-686:
-------------------------------------

[~vbalaji] [~shivnarayan] Please review this information closely.. In short, we can support
an indexing option that eliminates memory caching, but not sure if that will outperform current
BloomIndex. Is it worth cleaning this implementation up and checking it in? 

> Implement BloomIndexV2 that does not depend on memory caching
> -------------------------------------------------------------
>
>                 Key: HUDI-686
>                 URL: https://issues.apache.org/jira/browse/HUDI-686
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>          Components: Index, Performance
>            Reporter: Vinoth Chandar
>            Assignee: Vinoth Chandar
>            Priority: Major
>             Fix For: 0.6.0
>
>         Attachments: Screen Shot 2020-03-19 at 10.15.10 AM.png, Screen Shot 2020-03-19
at 10.15.10 AM.png, Screen Shot 2020-03-19 at 10.15.10 AM.png, image-2020-03-19-10-17-43-048.png
>
>
> Main goals here is to provide a much simpler index, without advanced optimizations like
auto tuned parallelism/skew handling but a better out-of-experience for small workloads. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message