Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A3B1A200D43 for ; Tue, 21 Nov 2017 20:29:03 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id A22E2160BFC; Tue, 21 Nov 2017 19:29:03 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E87E2160BE3 for ; Tue, 21 Nov 2017 20:29:02 +0100 (CET) Received: (qmail 49104 invoked by uid 500); 21 Nov 2017 19:29:02 -0000 Mailing-List: contact user-help@predictionio.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.apache.org Delivered-To: mailing list user@predictionio.apache.org Received: (qmail 49095 invoked by uid 99); 21 Nov 2017 19:29:02 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Nov 2017 19:29:02 +0000 Received: from mail-yb0-f172.google.com (mail-yb0-f172.google.com [209.85.213.172]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id B0EEB1A00C7 for ; Tue, 21 Nov 2017 19:29:01 +0000 (UTC) Received: by mail-yb0-f172.google.com with SMTP id g184so4936776ybg.3 for ; Tue, 21 Nov 2017 11:29:01 -0800 (PST) X-Gm-Message-State: AJaThX67ff3dYf5FMpu5XwUTT5sv4B5V7omPiyzu7IPMiEVqvLI8dLWh a5basyIFSID0wb3uK59mr2pkZEUPJUUboiZrbK8= X-Google-Smtp-Source: AGs4zMauImMSyRHRd/mw6P78uwqxZ1q30ppeK3YM7EZX5ZoUZm4hrfn/usVFu2a18RnY2UjYVS76OOL4wG+nUayhSvg= X-Received: by 10.37.215.71 with SMTP id o68mr11633205ybg.511.1511292540868; Tue, 21 Nov 2017 11:29:00 -0800 (PST) MIME-Version: 1.0 Received: by 10.129.94.87 with HTTP; Tue, 21 Nov 2017 11:28:40 -0800 (PST) In-Reply-To: References: From: Donald Szeto Date: Tue, 21 Nov 2017 11:28:40 -0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: pio batchpredict fix & changes To: user@predictionio.apache.org Content-Type: multipart/alternative; boundary="001a114fc8320cde65055e833781" archived-at: Tue, 21 Nov 2017 19:29:03 -0000 --001a114fc8320cde65055e833781 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi Mars, Thanks for the PR! I am still reviewing the code change, but at the high level it will take away the ability to run "batchpredict" remotely on a Spark cluster + HDFS/S3 setup, and requires extra steps of downloading input and uploading output files for such setup. It will unlikely scale to much larger dataset. That said, this is a very important and convenient feature. I'll help make it as good as possible. Regards, Donald On Mon, Nov 20, 2017 at 10:21 AM, Mars Hall wrote: > Hi PIO folks! > > Curious to hear from anyone using the new (as of PredictionIO 0.12.0) > batch predict command. > > - Do you use `pio batchpredict`? > - What is your use-case? (Which Engine template or algorithm?) > - Does it work for your use-case? (Details please) > > Batch predict is currently broken for engines using a custom > PersistentModel. I've been working on a fix for this issue: > https://github.com/apache/incubator-predictionio/pull/447 > > Because that changeset alters the way the command works, I'd appreciate > feedback on that pull request. *Especially if you use `pio batchpredict` > today*, please try it out, and post feedback to that Github issue. > > Thanks for your attention =F0=9F=98=84 > > -- > *Mars Hall > Customer Facing Architect > Salesforce Platform / Heroku > San Francisco, California > --001a114fc8320cde65055e833781 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi Mars,

Thanks for the PR! I am still = reviewing the code change, but at the high level it will take away the abil= ity to run "batchpredict" remotely on a Spark cluster + HDFS/S3 s= etup, and requires extra steps of downloading input and uploading output fi= les for such setup. It will unlikely scale to much larger dataset.

That said, this is a very important and convenient feature= . I'll help make it as good as possible.

Regar= ds,
Donald

On Mon, Nov 20, 2017 at 10:21 AM, Mars Hall <mars.= hall@salesforce.com> wrote:
Hi PIO folks!

Curious to he= ar from anyone using the new (as of PredictionIO 0.12.0) batch predict comm= and.=C2=A0
  • Do you use `pio batchpredict`?
  • What is= your use-case? (Which Engine template or algorithm?)
  • Does it work = for your use-case? (Details please)
Batch predict is cu= rrently broken for engines using a custom PersistentModel. I've been wo= rking on a fix for this issue:

Because that changeset alters the way the command works, I= 'd appreciate feedback on that pull request.=C2=A0Especially if you = use `pio batchpredict` today, please try it out, and post feedback to t= hat Github issue.

Thanks for your attention =F0=9F= =98=84

--
*Mars Hall
Customer Facin= g Architect
Salesforce Platform / Heroku
San Francisco,= California

--001a114fc8320cde65055e833781--