Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id D45F3200D41 for ; Wed, 8 Nov 2017 01:00:19 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id D2D60160C00; Wed, 8 Nov 2017 00:00:19 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 24D7C160BED for ; Wed, 8 Nov 2017 01:00:18 +0100 (CET) Received: (qmail 36586 invoked by uid 500); 8 Nov 2017 00:00:18 -0000 Mailing-List: contact dev-help@mxnet.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@mxnet.incubator.apache.org Delivered-To: mailing list dev@mxnet.incubator.apache.org Received: (qmail 36574 invoked by uid 99); 8 Nov 2017 00:00:18 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Nov 2017 00:00:18 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 2EE12CDDE0 for ; Wed, 8 Nov 2017 00:00:17 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.379 X-Spam-Level: ** X-Spam-Status: No, score=2.379 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id ZyYH05ajhCpQ for ; Wed, 8 Nov 2017 00:00:14 +0000 (UTC) Received: from mail-it0-f42.google.com (mail-it0-f42.google.com [209.85.214.42]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 75DC662BA7 for ; Tue, 7 Nov 2017 22:46:54 +0000 (UTC) Received: by mail-it0-f42.google.com with SMTP id c3so4507869itc.3 for ; Tue, 07 Nov 2017 14:46:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=xWlvaI7UUp57r25bvu6gJwGdqZ5UbPjXmfoYpI2GUt8=; b=srUwT75mVj2/zqShLbJPcka5LR4CjZ0RHzr3KQ6GiCivc47gAuvObonMj23ltSo1mp 9Q7L/eb3mOMbyZcNakmSlWk6hbifUu9YW+1zZaqcht5xLEdATJabDocFRuahLK8fbmmx i5VEkCOX4dd0IsdxfDfi+Tjyflzm6+Nw3wD+jRDOul9XOlyNOGy3F0vB0s21wZJe6t2s oBsiZ29bjbI5OzXWpnRH6K05YG8kecTJcoDK1bvFs3xqR0x+cDRyzRDc5Pp22ePUcESe h3za5iJAdCb5j8cYTPbh8+j1o5m4QIkQzTLwDBI3aVrLa/Pt0LGaQSQ6jS/fuHikso5X f3AQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=xWlvaI7UUp57r25bvu6gJwGdqZ5UbPjXmfoYpI2GUt8=; b=tMJq5U096PRQ3zP1shKix0XO7l/AbD1IO5AqyFfCDufSBcfq6/8RpEQXW6ou1Dso0d B93yoN7ZQyIIUpmaD/3YrnprYSxj6HIu2OxdAb4Ip/7e/vLLBfyyU5J1pSHE+j3EdHKd XptqoGaZPLnY7G4UymW4qJmdpwyqH2Xv32sVrdP1yGUFcAOCEOrsTbbZ8gqZjXA/0BH1 AfIJLB0zPNCsaBa4uPPEGSXtb8DrdsyTEvTsPhjgB+z/vcPzjXDmHoshmCUf+QX7cgaT 4HIp6FrwGR6gPsZDrQNbR1wKkOgNf/VDdUF5wW/LuNuF2ORH87Rhie8IqcslfoXCRWDr UQiA== X-Gm-Message-State: AJaThX4dUYRexvm9PAwZMCJ1ClLQNFk1YDfTmbY+2wQEvxEBCAGWRVsK HAjCTPWN8gtth6TlCtfH8AB55i3ypxW9pGfnEXYvBA== X-Google-Smtp-Source: ABhQp+RnWBME6bThnIFvXqgj4kf2v0WjoIC3ueIq/a2cDc0ENfsoKqrIpjgI8tPG1FAhfT1cTt5PGT8mAyPd/Fc4siU= X-Received: by 10.36.244.5 with SMTP id d5mr1077803iti.3.1510094813626; Tue, 07 Nov 2017 14:46:53 -0800 (PST) MIME-Version: 1.0 Received: by 10.2.137.250 with HTTP; Tue, 7 Nov 2017 14:46:53 -0800 (PST) From: Yao Wang Date: Tue, 7 Nov 2017 14:46:53 -0800 Message-ID: Subject: Data Pipeline Intermediate Representation in MXNet/NNVM To: dev@mxnet.apache.org Content-Type: multipart/alternative; boundary="f403045fbde0f1880c055d6c5844" archived-at: Wed, 08 Nov 2017 00:00:20 -0000 --f403045fbde0f1880c055d6c5844 Content-Type: text/plain; charset="UTF-8" Hi, Tensorflow has a transform package https://github.com/tensorflow/transform which is capable of export a data preprocessing pipeline to a tensorflow graph, which can be incorporated into network graph. This package provides a neat way to manage data pipeline together with network graph, since these data process graph can be easily reused by other developers. Also I think we can get some performance improvement by using computation graph for data process rather than imperative processing for large data stream? Currently in MXNet, if I want to do the similar thing, I need to pack the code(most time python script) directly with network graph files. This method has some issues: 1. Potential security issue. If I wrote the processing codes and I am the only person use it, it's fine. However, if someone else wants to reuse it in their application, they need to check the code to make sure there is no security issue. It is not quite portable for reusing. 2. It is bind to specific language. Usually it's easier to develop deep learning application using python, but if my production environment doesn't have python environment, I need to either setup python environment or rewrite this script with the language supported by my production environment. Any thought about supporting data pipeline IR in MXNet/NNVM? --f403045fbde0f1880c055d6c5844--