From dev-return-36727-archive-asf-public=cust-asf.ponee.io@ignite.apache.org Thu Jul 19 14:29:12 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 13EFB180630 for ; Thu, 19 Jul 2018 14:29:11 +0200 (CEST) Received: (qmail 55035 invoked by uid 500); 19 Jul 2018 12:29:11 -0000 Mailing-List: contact dev-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list dev@ignite.apache.org Received: (qmail 55023 invoked by uid 99); 19 Jul 2018 12:29:10 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jul 2018 12:29:10 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id F05B618067E for ; Thu, 19 Jul 2018 12:29:09 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.889 X-Spam-Level: * X-Spam-Status: No, score=1.889 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, T_DKIMWL_WL_MED=-0.01, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 2iTttkp4togC for ; Thu, 19 Jul 2018 12:29:09 +0000 (UTC) Received: from mail-lj1-f180.google.com (mail-lj1-f180.google.com [209.85.208.180]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 179B95F3B4 for ; Thu, 19 Jul 2018 12:29:09 +0000 (UTC) Received: by mail-lj1-f180.google.com with SMTP id f8-v6so7200652ljk.1 for ; Thu, 19 Jul 2018 05:29:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=+whyX798WCvfL+0/MQpUcKPFHnodJqdmQt/KOCVbZ1Q=; b=B6xj72Td+By2SbTz016OlPW3+3l2EpmE2QhMEV4q2NkipKnl9D2s3/wM3qct2yUtGI 0iBuRUS/uKxYlYNx8UQ7aeM4khR0/p28dlc2nZiQClZMVfTSpkna/WZuSRRId/e+6ZM+ FL+oHynrbPGrCmqs642wtYvdHj4WZroagt+H8Pah98uGFy5K7qUepI9HCR252NHQFM5H bMERNfgQujfRDztlfPQvdORqM/oUp5qiyZrXugeesIiHwVoZ2ZRs266/gMeZWJ8EAo5I lZPkl6JiM0Nt6raH8Yw+yiEnjFRFWK2zFZOMZQytmEx3aZLQHKndoAdo67drVWNPD4RA tFRQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=+whyX798WCvfL+0/MQpUcKPFHnodJqdmQt/KOCVbZ1Q=; b=Xm//EPItfkcm28APRdSogsqXjR3mfOLdIZNfcqN8zGCiCAmmtfHElJ+I9EKYfidUuf IEXWxA1I07f1+rueuOManUwlWngSR7pkMVEPZ9aumJlqxWULc5naLwhQOOtHNFEpAXlF 2jsG4h4mpEL5vN6YnkQSGaVQA5C6WqBoS74fk2tolpeCePwvasQS0FJRvdrlFWRBEiux 8rWBlpJoCtFQM27qqUfjp9AvqYWjQ/7PFJUyr7KpACe1qz1Emr1CK7zI3TZAcOnV9AYf mof7FAnksTcD0yygtxlH2+YJTB8xE981YM6UuU46yyH9yicEXqUZN2EZexRFhpLe744X QfWg== X-Gm-Message-State: AOUpUlFaxMujiO1OEYFdLiHIlXLJPsbUXsWlpU1zapMRArrIOHBYQzyP 0evXGtJLz3pR4fdwy4/e9Y4LTtq3czUzPrBNjjrhSI2Q X-Google-Smtp-Source: AAOMgpddl/m0k+F0eUNjqvNtKm0r3dfrnh9qcg12TFFgYzVPa01iwRjWl7ev00k/hHt0QPqrraAotHw/ldEL/ak4AEo= X-Received: by 2002:a2e:4103:: with SMTP id o3-v6mr7422992lja.3.1532003342341; Thu, 19 Jul 2018 05:29:02 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a2e:94c7:0:0:0:0:0 with HTTP; Thu, 19 Jul 2018 05:29:01 -0700 (PDT) From: Alexey Zinoviev Date: Thu, 19 Jul 2018 15:29:01 +0300 Message-ID: Subject: [ML] Machine Learning Pipeline Improvement To: dev@ignite.apache.org Content-Type: multipart/alternative; boundary="00000000000003dc47057159537f" --00000000000003dc47057159537f Content-Type: text/plain; charset="UTF-8" Hi Igniters, I suggest to add and implement by myself sequential pipeline of machine learning operations including all preprocessing stages like Pipeline object in Python library scikit-learn (look here http://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.html for the details) It can be combined with current Cross-Validator and Evaluator objects. The possible solution will sequentially apply a list of transforms and a final estimator. Alexey --00000000000003dc47057159537f--