Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E1D7F200CAE for ; Wed, 7 Jun 2017 04:35:05 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id E095B160BD3; Wed, 7 Jun 2017 02:35:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0B4F5160BC6 for ; Wed, 7 Jun 2017 04:35:04 +0200 (CEST) Received: (qmail 77729 invoked by uid 500); 7 Jun 2017 02:35:04 -0000 Mailing-List: contact user-help@predictionio.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@predictionio.incubator.apache.org Delivered-To: mailing list user@predictionio.incubator.apache.org Received: (qmail 77719 invoked by uid 99); 7 Jun 2017 02:35:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Jun 2017 02:35:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 97BA7D0003 for ; Wed, 7 Jun 2017 02:35:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.121 X-Spam-Level: X-Spam-Status: No, score=-0.121 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 4l4pKgXu0MuC for ; Wed, 7 Jun 2017 02:35:02 +0000 (UTC) Received: from mail-ot0-f179.google.com (mail-ot0-f179.google.com [74.125.82.179]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 783455FD9A for ; Wed, 7 Jun 2017 02:35:02 +0000 (UTC) Received: by mail-ot0-f179.google.com with SMTP id k4so273453otd.0 for ; Tue, 06 Jun 2017 19:35:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-transfer-encoding; bh=luo5rDP4TzZVrRtkOnZOfIWZElJPqd7NoJ5f2SSDAdU=; b=JOpNv5/m8fMfG/vKuYCq6wuw8/tgWC/I+X6IIZcdxcn6Ea59QrJ0Xbb4jfZOPZyhFx hfIJRMYDgQvvCoOLb7fYgMjbGuKA8WToS/ZMG0wO+2Bkmsh7qWTfnVs3EzAbzAkXeuVk ZlS78yQYDUsLlhQWoaPlyQhsqFtRmgXFMFlTGxmsCkLoXZNshlCPYPc/Tyf6JD31AdpG emv/Y87I9P0nJD2k4KCPQnD8IuOUX01gbHIXk0DnqIsQhrmX16GXRX1nuq4Bm0tZJNMT amtep6V6wLcea+i0c9StLrRCXTEAw7IQQVuWQspYmm/siyIcZtmKqqANrtOBZtKVF+Zd WdsA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-transfer-encoding; bh=luo5rDP4TzZVrRtkOnZOfIWZElJPqd7NoJ5f2SSDAdU=; b=PwktepRN2h3WDsKNJcevr2ar2dfhQDaiVFjcZKrP0EIArq7lQVTRSODlEc3j9WLubY Tq51+lvazGikC+a3N/zR4lXb6e5LZYwdtPNP95OD8Otf22SrVGgybWm3N27ZoY+B4P6y JzEvIg961A8GNIMePvTCjLZBMjHT78Esrjr2B2bqPY/U0N7ME16QFwUxOZeQ1StND48s N7mzr9HphRNyn+niUreIAG6utdLxr0hYLbILy8iT1d/iKTmCb2TxSw1hr9vVdGRqvGXR 7xQ57wBBwPV1lFVugR63imd6AXVn7glze4H/x+F2p1TcMSbN7+y8J3b1ZVRkEzFgM4zA 5b+g== X-Gm-Message-State: AODbwcAWz8dFnFd7bJ+R6hWf/uu6QB4/Ahs3Wl5NSXcXhqIPkk6RdCxO r5A6tojJlOhzbCdd5Tk= X-Received: by 10.157.12.156 with SMTP id b28mr14119498otb.94.1496802896464; Tue, 06 Jun 2017 19:34:56 -0700 (PDT) Received: from mail-oi0-f44.google.com (mail-oi0-f44.google.com. [209.85.218.44]) by smtp.gmail.com with ESMTPSA id b67sm205171oif.7.2017.06.06.19.34.55 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 06 Jun 2017 19:34:55 -0700 (PDT) Received: by mail-oi0-f44.google.com with SMTP id h4so189676oib.3 for ; Tue, 06 Jun 2017 19:34:55 -0700 (PDT) X-Received: by 10.202.102.33 with SMTP id a33mr13807292oic.218.1496802895462; Tue, 06 Jun 2017 19:34:55 -0700 (PDT) MIME-Version: 1.0 Received: by 10.74.152.18 with HTTP; Tue, 6 Jun 2017 19:34:54 -0700 (PDT) In-Reply-To: <66905316-87C6-4A6E-AEC0-EFD5E5121B9C@occamsmachete.com> References: <0340760F-5CD7-40C3-82B2-013312F5324E@occamsmachete.com> <66905316-87C6-4A6E-AEC0-EFD5E5121B9C@occamsmachete.com> From: Shinsuke Sugaya Date: Wed, 7 Jun 2017 11:34:54 +0900 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Update default build targets To: user@predictionio.incubator.apache.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable archived-at: Wed, 07 Jun 2017 02:35:06 -0000 I think PIO is difficult to support each corp version policies. For platforms other than major ones, user will be able to build it from source distribution as same as the current situation. I checked some major platforms: AWS(EMR 5.6.0) - Hadoop 2.7.3 - Spark 2.1.1 Cloudera(CDH 5.11) - Hadoop 2.6.0 - Spark 1.6.0/2.1 Release 1 Horton(HDP 2.6) - Hadoop 2.7.3 - Spark 1.6.3/2.1 For Hadoop 2.6 and Spark 2.1, our updated dependencies will work. I agree with Scala 2.11. For Python, major machine-learning packages, such as scikit-learn, still support Python 2.7. I think that Python 2 users exist. Therefore, I prefer PIO to follow major packages if there is not Python version problem. (Of course, I'd like to remove Python 2 support...) Current ES5 support uses REST, not Transport, PIO does not bind to the version of deployed Elasticsearch and also Elasticsearch libraries have a backforward compatibility. If user uses Elasticsearch 5 or the above, PIO will work even if PIO uses the latest ES dependency. So, it doesn't matter if old ES 5.x is deployed. Regards, shinsuke 2017-06-06 23:37 GMT+09:00 Pat Ferrel : > Hmm, I=E2=80=99d rather see our release on versions that are most commonl= y deployed. A systems with so many deps as PIO can run afoul of corp versio= n policies. Then a build for all reasonable deps will take care of edge cas= es. > > py3 and scala 2.11 seem to be the most commonly deployed, not so much for= hadoop 2.7 afaict > > ES5 I agree is problematic. One big reason to move to it is so users can = use SaaS ES, maybe someone in user land can better say what version is out = in SaaS? I=E2=80=99m not concerned with ES release schedule, at issue is ES= adoption level. > > > On Jun 5, 2017, at 9:44 PM, Shinsuke Sugaya w= rote: > >> What is the policy driving dependency upgrades? > > Although it might be difficult to define it, > how about the following policy: > > - Select newer dependencies > - If the above one is not supported in major platforms(ex. AWS, > Cloudera, Horton), change to the lower version > - Review the version of dependencies at every releases if needed > > As for elasticsearch, I would like to keep a newer version > since it will be released monthly. > >> I don=E2=80=99t run hadoop 2.7 locally and many users that have Cloudera= or Horton >> contracts may not either. > > Thank you for the info. > CDH seems not to support hadoop 2.7... > Updated as below: > > > 0.12.0: > - PIO_SCALA_VERSION=3D2.11.8 > - PIO_SPARK_VERSION=3D2.1.1 > - PIO_ELASTICSEARCH_VERSION=3D5.4.1 > - PIO_HADOOP_VERSION=3D2.6.5 > > > For Python 2/3, I fixed some templates, such as recommender. > Since I think template scripts can support both Python 2/3, > I'll fix them. > > Regards, > shinsuke > > 2017-06-06 4:57 GMT+09:00 Pat Ferrel : >> What is the policy driving dependency upgrades? >> >> I don=E2=80=99t run hadoop 2.7 locally and many users that have Cloudera= or Horton >> contracts may not either. Not sure why this should be the default until = it=E2=80=99s >> the most popular of we need some feature of it. >> >> I=E2=80=99d agree with most of what @Shinsuke suggests as long as there = is an easy >> way to build for any reasonable combination of deps. >> >> The hard one will be Python 3. All existing python scripts in templates = will >> need upgrading since it=E2=80=99s very difficult to support mixed py2 an= d py3 where >> scala 2.10 and 2,11 are much easier. I still think it=E2=80=99s time to = do this but >> mention it because with each upgrade we need to consider how may templat= es >> are left even further behind. Many now do not work with Apache PIO, this= may >> put them further behind. >> >> Though we work on PIO we must remember that PIO does nothing interesting >> without templates and ask ourselves what pain we may cause for template >> users. >> >> >> >> On Jun 5, 2017, at 11:06 AM, Donald Szeto wrote: >> >> Hey all, this has a huge impact to the default build, so If you see any >> issue with this, please let us know as soon as possible. >> >> On Sun, Jun 4, 2017 at 10:25 PM, Shinsuke Sugaya >> wrote: >>> >>> Hi all, >>> >>> We have a plan to change default build targets in PIO-83 and PIO-84. >>> Current versions look too old, so it will be better to support >>> newer versions as default. >>> >>> Current: >>> - PIO_SCALA_VERSION=3D2.10.6 >>> - PIO_SPARK_VERSION=3D1.6.3 >>> - PIO_ELASTICSEARCH_VERSION=3D1.7.6 >>> - PIO_HADOOP_VERSION=3D2.6.5 >>> >>> They will be changed to: >>> >>> 0.12.0: >>> - PIO_SCALA_VERSION=3D2.11.8 >>> - PIO_SPARK_VERSION=3D2.1.1 >>> - PIO_ELASTICSEARCH_VERSION=3D5.4.1 >>> - PIO_HADOOP_VERSION=3D2.7.3 >>> >>> Note that this change does not discard old version supports. >>> If you want to use old versions, you can build PIO with them. >>> >>> Please let us know if you have any concerns. >>> >>> https://issues.apache.org/jira/browse/PIO-83 >>> https://issues.apache.org/jira/browse/PIO-84 >>> >>> Regards, >>> shinsuke >> >> >> >