Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ECFBF1744B for ; Fri, 10 Apr 2015 12:48:18 +0000 (UTC) Received: (qmail 64052 invoked by uid 500); 10 Apr 2015 12:48:13 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 63941 invoked by uid 500); 10 Apr 2015 12:48:13 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 63930 invoked by uid 99); 10 Apr 2015 12:48:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 12:48:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of shahab.yunus@gmail.com designates 209.85.215.54 as permitted sender) Received: from [209.85.215.54] (HELO mail-la0-f54.google.com) (209.85.215.54) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 12:47:47 +0000 Received: by layy10 with SMTP id y10so12428436lay.0 for ; Fri, 10 Apr 2015 05:47:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=JMRWWhERwifxq4HJxNSZu1rB62blqreXI+PtgnWWeW4=; b=e4mbWCMJLFFYYhFvIp3T2NQcPe2Dut1TCnbyN1n6w+lEVvOLY8NBTFF+JsXPtTVlVU 2LcW8OlWBufV038UFuGDNLu8e4XEeKEAUqaKgbqA58fOvBx+cTu7XD7rEc5HG3kxt7Uu IOH4eub5Ld/EI6bjTDiD/l5UdDkjRoRVOI2PxpJKf9C0n2IVIEFqEOzghVtJXann4WqX oYLCmm2Uij+EaqLAIPhx1FicbV9tOjQlToI4q4JF5kQ/hRHwgtVKclJ4t134lpEKVdJ1 ptI3q9D1H2a9VZ42VPalJjlaZQnJCmzfWJWHTlEBCL7JopRNQs9tCyIJKkT5rzcBhGMz 7INg== MIME-Version: 1.0 X-Received: by 10.152.206.75 with SMTP id lm11mr1278102lac.41.1428670066426; Fri, 10 Apr 2015 05:47:46 -0700 (PDT) Received: by 10.25.213.75 with HTTP; Fri, 10 Apr 2015 05:47:46 -0700 (PDT) In-Reply-To: References: Date: Fri, 10 Apr 2015 08:47:46 -0400 Message-ID: Subject: Re: Hadoop or spark From: Shahab Yunus To: "user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a1133bdc6cf8a3705135e2bb7 X-Virus-Checked: Checked by ClamAV on apache.org --001a1133bdc6cf8a3705135e2bb7 Content-Type: text/plain; charset=UTF-8 I hope I am not misunderstanding your question but I don't think there is a comparison between Spark and Hadoop. They are different things. Hadoop is a platform on which you can run Yarn, HBase and even Spark. E.g. Cloudera's Hadoop distribution has Spark, Hbase, Impala, Pig etc. as part of its installation. Spark can run within a Hadoop cluster deployment. I think a more apt comparison would be something like whether you should use regular MapReduce on Yarn on Hadoop OR Spark on Hadoop. Or even more direct would be Spark vs. Storm, which has been discussed here. http://marc.info/?l=hadoop-user&m=140434265901449 Regards, Shahab On Fri, Apr 10, 2015 at 1:08 AM, Ashutosh Kumar wrote: > How do I decide whether I should go for Hadoop or Spark for a greenfield > project . I tried to find out and looks like Spark can do everything that > hadoop can do. Appreciate your thoughts on it. > > Thanks > > --001a1133bdc6cf8a3705135e2bb7 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I hope I am not misunderstanding your question but I don&#= 39;t think there is a comparison between Spark and Hadoop. They are differe= nt things.

Hadoop is a platform on which you can run Yar= n, HBase and even Spark. E.g. Cloudera's Hadoop distribution has Spark,= Hbase, Impala, Pig etc. as part of its installation. Spark can run within = a Hadoop cluster deployment.

I think a more apt co= mparison would be something like whether you should use regular MapReduce o= n Yarn on Hadoop OR Spark on Hadoop.

Or even more = direct would be Spark vs. Storm, which has been discussed here.
<= a href=3D"http://marc.info/?l=3Dhadoop-user&m=3D140434265901449">http:/= /marc.info/?l=3Dhadoop-user&m=3D140434265901449

<= /div>
Regards,
Shahab


=

On Fri, Apr= 10, 2015 at 1:08 AM, Ashutosh Kumar <ashutosh.k78@gmail.com><= /span> wrote:
How d= o I decide whether I should go for Hadoop or Spark for a greenfield project= . I tried to find out and looks like Spark can do everything that hadoop c= an do. Appreciate your thoughts on it.

Thanks


--001a1133bdc6cf8a3705135e2bb7--