flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fabian Hueske (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-1444) Add data properties for data sources
Date Sat, 24 Jan 2015 22:21:35 GMT
Fabian Hueske created FLINK-1444:

             Summary: Add data properties for data sources
                 Key: FLINK-1444
                 URL: https://issues.apache.org/jira/browse/FLINK-1444
             Project: Flink
          Issue Type: New Feature
          Components: Java API, JobManager, Optimizer
    Affects Versions: 0.9
            Reporter: Fabian Hueske
            Priority: Minor

This issue proposes to add support for attaching data properties to data sources. These data
properties are defined with respect to input splits.
Possible properties are:

- partitioning across splits: all elements of the same key (combination) are contained in
one split
- sorting / grouping with splits: elements are sorted or grouped on certain keys within a
- key uniqueness: a certain key (combination) is unique for all elements of the data source.
This property is not defined wrt. input splits.

The optimizer can leverage this information to generate more efficient execution plans.

This message was sent by Atlassian JIRA

View raw message