Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id D4170200B14 for ; Fri, 3 Jun 2016 21:18:00 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id D2CCB160A49; Fri, 3 Jun 2016 19:18:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 28243160A3B for ; Fri, 3 Jun 2016 21:18:00 +0200 (CEST) Received: (qmail 87112 invoked by uid 500); 3 Jun 2016 19:17:59 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 87090 invoked by uid 99); 3 Jun 2016 19:17:59 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Jun 2016 19:17:59 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 2F5312C14F8 for ; Fri, 3 Jun 2016 19:17:59 +0000 (UTC) Date: Fri, 3 Jun 2016 19:17:59 +0000 (UTC) From: "ramkrishna.s.vasudevan (JIRA)" To: issues@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (FLINK-3850) Add forward field annotations to DataSet operators generated by the Table API MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 03 Jun 2016 19:18:01 -0000 [ https://issues.apache.org/jira/browse/FLINK-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15314630#comment-15314630 ] ramkrishna.s.vasudevan commented on FLINK-3850: ----------------------------------------------- [~fhueske] Is it possible that I could work on this issue, with your guidance and feedback? Let me know what you think. > Add forward field annotations to DataSet operators generated by the Table API > ----------------------------------------------------------------------------- > > Key: FLINK-3850 > URL: https://issues.apache.org/jira/browse/FLINK-3850 > Project: Flink > Issue Type: Improvement > Components: Table API > Reporter: Fabian Hueske > > The DataSet API features semantic annotations [1] to hint the optimizer which input fields an operator copies. This information is valuable for the optimizer because it can infer that certain physical properties such as partitioning or sorting are not destroyed by user functions and thus generate more efficient execution plans. > The Table API is built on top of the DataSet API and generates DataSet programs and code for user-defined functions. Hence, it knows exactly which fields are modified and which not. We should use this information to automatically generate forward field annotations and attach them to the operators. This can help to significantly improve the performance of certain jobs. > [1] https://ci.apache.org/projects/flink/flink-docs-release-1.0/apis/batch/index.html#semantic-annotations -- This message was sent by Atlassian JIRA (v6.3.4#6332)