apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ananth (JIRA)" <j...@apache.org>
Subject [jira] [Created] (APEXMALHAR-2472) Implement Kudu Input Operator
Date Sat, 01 Apr 2017 22:13:41 GMT
Ananth created APEXMALHAR-2472:

             Summary: Implement Kudu Input Operator 
                 Key: APEXMALHAR-2472
                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2472
             Project: Apache Apex Malhar
          Issue Type: Bug
          Components: adapters database
            Reporter: Ananth
            Assignee: Ananth

This operator would allow Kudu to be used as an Input store. This has multiple advantages
like : 

- Ability to solve the dedup problem from entire data set perspective. The dedupe operators
we have today are primarily window based and this might not meet all of the use cases in real
- Ability to selectively stream data based on a SQL expression. Since Kudu is a structural
store, we could effectively allow a SQL expression based "input" definition that would allow
for selective streaming for all downstream operators. This could potentially be an alternative
streaming store pattern as compared to Kafka as Kafka does not allow for selective streaming
of tuples. 

This message was sent by Atlassian JIRA

View raw message