Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C882D200B50 for ; Fri, 29 Jul 2016 22:13:24 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C73A6160A79; Fri, 29 Jul 2016 20:13:24 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1AEED160A6E for ; Fri, 29 Jul 2016 22:13:23 +0200 (CEST) Received: (qmail 12996 invoked by uid 500); 29 Jul 2016 20:13:22 -0000 Mailing-List: contact dev-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list dev@spark.apache.org Received: (qmail 12970 invoked by uid 99); 29 Jul 2016 20:13:22 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Jul 2016 20:13:22 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id E808DC0DAB for ; Fri, 29 Jul 2016 20:13:21 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id PxsQMLzsgMcZ for ; Fri, 29 Jul 2016 20:13:19 +0000 (UTC) Received: from mail-oi0-f52.google.com (mail-oi0-f52.google.com [209.85.218.52]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id 208535F256 for ; Fri, 29 Jul 2016 20:13:19 +0000 (UTC) Received: by mail-oi0-f52.google.com with SMTP id l72so121020659oig.2 for ; Fri, 29 Jul 2016 13:13:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=HvkNdpiIZQC89TYKhJdrI8ONwc0JJwzVlYIlFWlLOSU=; b=TF/N6f2mHdVHq60+fblpsbDDEMDHtpmYc1E71D6XKpV6n4278Ntm5cM/zGn7U6kxV2 f8QzPmJMWH5YogcgE8rDscvNA4Id6xQs6ynesjey2Pd56RguIbL535pa8QfsPK6+N41/ hXwMhl/OUv2qtEKn6E6Udh8k6tDKVCpFWyyB+7SMHpI0ZISehmq8way1r3KFM+YpOfG9 n2JanjxcTk4x1N/Kr5NbFDbZfxYim8kFmIBE0dkpVhEieQjS1HsqmRoouX9//8z4SivZ ZmYEtXjbo5cg/PoGtP4ITCNRiFBP8Rp95Jyh7+rIfPmrY19JpqFpPwHqLpj8HO9nj+DI dpSw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=HvkNdpiIZQC89TYKhJdrI8ONwc0JJwzVlYIlFWlLOSU=; b=gyoi2FwtpQCbbU+sTWUTtFh/veL/OEyB5NOxCq5P6E02fvCwmpgTFh/4b62jlt1uMH FQdKuIdgMuORiRNxIyMmvsQHm4Yg7bFUVA4Xif1UqZmi1ZeCo1e5W8dIkeZHu+XHi3Rp IoD7AeUwgyRGH4b/FVj3QWcc1qtTX4pOhvUDzh+7NHBzIUR4T1nh4NkAe9ma7e5lQ+gu NYKALm4DZ5q0hgYekoyZWcNrg7m0KjmgIiVMqf8OKo0M9zowJqMJxIypDpVxftMAcdhc c1NZ4+N9APeA1GJcatBvUp1MRjy7927je1humc5oUwAkfbkey6J8kq/BZ+XcPej8atZU TckA== X-Gm-Message-State: AEkoouv0tMdcT5tMAQwygS4mGyQtrouwY4voNQFbnWcWkfXSIuC6qDX7Fbh24gjVs/F8THmBNf95UarjLZynwg== X-Received: by 10.157.20.4 with SMTP id h4mr28488715oth.42.1469823198025; Fri, 29 Jul 2016 13:13:18 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Nicholas Chammas Date: Fri, 29 Jul 2016 20:13:08 +0000 Message-ID: Subject: Re: Clarifying that spark-x.x.x-bin-hadoopx.x.tgz doesn't include Hadoop itself To: Marcelo Vanzin Cc: Spark dev list Content-Type: multipart/alternative; boundary=001a113ba24899dee70538cbe175 archived-at: Fri, 29 Jul 2016 20:13:25 -0000 --001a113ba24899dee70538cbe175 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hmm, perhaps I'm the one who's confused. =F0=9F=A4=94 I thought the person in the linked discussion expected Hadoop itself (i.e. the full application, not just the jars) to somehow be included, but rereading the discussion I may have just misinterpreted them. The Hadoop jars packaged with Spark just allow Spark to interact with Hadoop, or allow it to use the Hadoop API for interacting with systems like S3, right? If you want HDFS, MapReduce, etc. you're obviously not getting that in those Spark packages. Maybe this was already clear to those users and I just injected some confusion into the discussion. Nick On Fri, Jul 29, 2016 at 4:06 PM Marcelo Vanzin wrote: > Why do you say Hadoop is not included? > > The Hadoop jars are there in the tarball, and match the advertised > version. There is (or at least there was in 1.x) a version called > "without-hadoop" which did not include any Hadoop jars. > > On Fri, Jul 29, 2016 at 12:56 PM, Nicholas Chammas > wrote: > > I had an interaction on my project today that suggested some people may > be > > confused about what the packages available on the downloads page are > > actually for. > > > > Specifically, the various -hadoopx.x.tgz packages suggest that Hadoop > itself > > is actually included in the package. I=E2=80=99m not 100% sure myself h= onestly, > but > > as I explained in my comment linked above, I believe the -hadoopx.x.tgz > just > > indicates the version of Hadoop that Spark was built against. > > > > Does it make sense to add a brief note to the downloads page explaining > > this? > > > > I am assuming it would be too disruptive to change the package names to > > something more descriptive like -built-against-hadoopx.x.tgz. > > > > Nick > > > > -- > Marcelo > --001a113ba24899dee70538cbe175 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hmm, perhaps I'm the one who's confused. =F0=9F=A4= =94

I thought the person in the linked discussion expect= ed Hadoop itself (i.e. the full application, not just the jars) to somehow = be included, but rereading the discussion I may have just misinterpreted th= em.

The Hadoop jars packaged with Spark just allow= Spark to interact with Hadoop, or allow it to use the Hadoop API for inter= acting with systems like S3, right?=C2=A0If= you want HDFS, MapReduce, etc. you're obviously not getting that in th= ose Spark packages.

Maybe this was already = clear to those users and I just injected some confusion into the discussion= .

Nick

<= div dir=3D"ltr">On Fri, Jul 29, 2016 at 4:06 PM Marcelo Vanzin <vanzin@cloudera.com> wrote:
Why do you say Hadoop is not included?

The Hadoop jars are there in the tarball, and match the advertised
version. There is (or at least there was in 1.x) a version called
"without-hadoop" which did not include any Hadoop jars.

On Fri, Jul 29, 2016 at 12:56 PM, Nicholas Chammas
<nichola= s.chammas@gmail.com> wrote:
> I had an interaction on my project today that suggested some people ma= y be
> confused about what the packages available on the downloads page are > actually for.
>
> Specifically, the various -hadoopx.x.tgz packages suggest that Hadoop = itself
> is actually included in the package. I=E2=80=99m not 100% sure myself = honestly, but
> as I explained in my comment linked above, I believe the -hadoopx.x.tg= z just
> indicates the version of Hadoop that Spark was built against.
>
> Does it make sense to add a brief note to the downloads page explainin= g
> this?
>
> I am assuming it would be too disruptive to change the package names t= o
> something more descriptive like -built-against-hadoopx.x.tgz.
>
> Nick



--
Marcelo
--001a113ba24899dee70538cbe175--