Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 42ED21759E for ; Fri, 10 Apr 2015 13:11:55 +0000 (UTC) Received: (qmail 39991 invoked by uid 500); 10 Apr 2015 13:11:50 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 39881 invoked by uid 500); 10 Apr 2015 13:11:50 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 39871 invoked by uid 99); 10 Apr 2015 13:11:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 13:11:50 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mohajeri@gmail.com designates 209.85.214.174 as permitted sender) Received: from [209.85.214.174] (HELO mail-ob0-f174.google.com) (209.85.214.174) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 13:11:25 +0000 Received: by obbeb7 with SMTP id eb7so13387753obb.3 for ; Fri, 10 Apr 2015 06:10:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=RYYxuBBN4eTyU4YSyUqJThfqATSrjO328uZhpSqfw4w=; b=ueD+N+QVmCnKv6z+b1cbfA8RhXN8KRZKZANDaHcCoyHwC4iI/p7VH5cRzcYExy1WzI Vn1b7FzmGIFtoxzT8F1V+rY/vYDHVb3EtdlBAVDf23d14fH20qoNxDJtyY1vlH1PhiNt xA+0LrLxKOrNnhIsRdfO02Sox4bH4ErHvFWiNDcgH7hrhRfpDCIcRzNbJ/FqN1bDNvim RtZMxHtGaE9JKAb9T/yL5si9aNSFXHik+8wFhZSc2WWK7qOGFMReMAhD1qIpqpVLvtl8 Lr/HkoU9/i+dE9sw+eihaSuHe1bKAgpYLHUHC1oF451GPGO6M2nDu/UxA+LkuBG90EVr pqIw== MIME-Version: 1.0 X-Received: by 10.182.131.130 with SMTP id om2mr1808178obb.70.1428671438449; Fri, 10 Apr 2015 06:10:38 -0700 (PDT) Received: by 10.76.4.78 with HTTP; Fri, 10 Apr 2015 06:10:38 -0700 (PDT) In-Reply-To: References: Date: Fri, 10 Apr 2015 06:10:38 -0700 Message-ID: Subject: Re: Hadoop or spark From: Peyman Mohajerian To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=089e01634b9096f10405135e7d4e X-Virus-Checked: Checked by ClamAV on apache.org --089e01634b9096f10405135e7d4e Content-Type: text/plain; charset=UTF-8 There actually is such a discussion, e.g.: http://www.slideshare.net/sbaltagi/spark-or-hadoop-is-it-an-eitheror-proposition-by-slim-baltagi you can have a standalone Spark cluster with no dependency on Hadoop. On Fri, Apr 10, 2015 at 5:47 AM, Shahab Yunus wrote: > I hope I am not misunderstanding your question but I don't think there is > a comparison between Spark and Hadoop. They are different things. > > Hadoop is a platform on which you can run Yarn, HBase and even Spark. E.g. > Cloudera's Hadoop distribution has Spark, Hbase, Impala, Pig etc. as part > of its installation. Spark can run within a Hadoop cluster deployment. > > I think a more apt comparison would be something like whether you should > use regular MapReduce on Yarn on Hadoop OR Spark on Hadoop. > > Or even more direct would be Spark vs. Storm, which has been discussed > here. > http://marc.info/?l=hadoop-user&m=140434265901449 > > Regards, > Shahab > > > > On Fri, Apr 10, 2015 at 1:08 AM, Ashutosh Kumar > wrote: > >> How do I decide whether I should go for Hadoop or Spark for a greenfield >> project . I tried to find out and looks like Spark can do everything that >> hadoop can do. Appreciate your thoughts on it. >> >> Thanks >> >> > --089e01634b9096f10405135e7d4e Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
There actually is such a discussion, e.g.:

you can have a standalone Spark cluster with no dependency on Hadoop.

On Fri, = Apr 10, 2015 at 5:47 AM, Shahab Yunus <shahab.yunus@gmail.com>= wrote:
I hope I = am not misunderstanding your question but I don't think there is a comp= arison between Spark and Hadoop. They are different things.

<= div>Hadoop is a platform on which you can run Yarn, HBase and even Spark. E= .g. Cloudera's Hadoop distribution has Spark, Hbase, Impala, Pig etc. a= s part of its installation. Spark can run within a Hadoop cluster deploymen= t.

I think a more apt comparison would be somethin= g like whether you should use regular MapReduce on Yarn on Hadoop OR Spark = on Hadoop.

Or even more direct would be Spark vs. = Storm, which has been discussed here.

Regards,
Shahab



On Fri, Apr 10, 2015 at 1:08 AM, Ashutosh Kumar <ashutosh.k78@gmail.com> wrote:
How do I decide whether I should go for Hadoop o= r Spark for a greenfield project . I tried to find out and looks like Spark= can do everything that hadoop can do. Appreciate your thoughts on it.
<= br>
Thanks



--089e01634b9096f10405135e7d4e--