Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of mohajeri@gmail.com designates
 209.85.214.174 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAEo-6+T8hALVoTQygdLa0Ly904pagbOHqHSVe1rLJjsvsvwKPw@mail.gmail.com>
References: 
 <CAFZGXH2ZM-QjKmoDGPe5e_pm_xGPdbHEg_iQquTBLOS4FZ+Xeg@mail.gmail.com>
	<CAEo-6+T8hALVoTQygdLa0Ly904pagbOHqHSVe1rLJjsvsvwKPw@mail.gmail.com>
Date: Fri, 10 Apr 2015 06:10:38 -0700
Message-ID: 
 <CAO6JcpiWfrCydbmQxrW=moBAMK8x-fumr=Ja=u6-Na8xOKi=yQ@mail.gmail.com>
Subject: Re: Hadoop or spark
From: Peyman Mohajerian <mohajeri@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=089e01634b9096f10405135e7d4e

--089e01634b9096f10405135e7d4e
Content-Type: text/plain; charset=UTF-8

There actually is such a discussion, e.g.:
http://www.slideshare.net/sbaltagi/spark-or-hadoop-is-it-an-eitheror-proposition-by-slim-baltagi

you can have a standalone Spark cluster with no dependency on Hadoop.

On Fri, Apr 10, 2015 at 5:47 AM, Shahab Yunus <shahab.yunus@gmail.com>
wrote:

> I hope I am not misunderstanding your question but I don't think there is
> a comparison between Spark and Hadoop. They are different things.
>
> Hadoop is a platform on which you can run Yarn, HBase and even Spark. E.g.
> Cloudera's Hadoop distribution has Spark, Hbase, Impala, Pig etc. as part
> of its installation. Spark can run within a Hadoop cluster deployment.
>
> I think a more apt comparison would be something like whether you should
> use regular MapReduce on Yarn on Hadoop OR Spark on Hadoop.
>
> Or even more direct would be Spark vs. Storm, which has been discussed
> here.
> http://marc.info/?l=hadoop-user&m=140434265901449
>
> Regards,
> Shahab
>
>
>
> On Fri, Apr 10, 2015 at 1:08 AM, Ashutosh Kumar <ashutosh.k78@gmail.com>
> wrote:
>
>> How do I decide whether I should go for Hadoop or Spark for a greenfield
>> project . I tried to find out and looks like Spark can do everything that
>> hadoop can do. Appreciate your thoughts on it.
>>
>> Thanks
>>
>>
>

--089e01634b9096f10405135e7d4e
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">There actually is such a discussion, e.g.:<div><a href=3D"=
http://www.slideshare.net/sbaltagi/spark-or-hadoop-is-it-an-eitheror-propos=
ition-by-slim-baltagi">http://www.slideshare.net/sbaltagi/spark-or-hadoop-i=
s-it-an-eitheror-proposition-by-slim-baltagi</a><br></div><div><br></div><d=
iv>you can have a standalone Spark cluster with no dependency on Hadoop.</d=
iv></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, =
Apr 10, 2015 at 5:47 AM, Shahab Yunus <span dir=3D"ltr">&lt;<a href=3D"mail=
to:shahab.yunus@gmail.com" target=3D"_blank">shahab.yunus@gmail.com</a>&gt;=
</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .=
8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">I hope I =
am not misunderstanding your question but I don&#39;t think there is a comp=
arison between Spark and Hadoop. They are different things.<div><br></div><=
div>Hadoop is a platform on which you can run Yarn, HBase and even Spark. E=
.g. Cloudera&#39;s Hadoop distribution has Spark, Hbase, Impala, Pig etc. a=
s part of its installation. Spark can run within a Hadoop cluster deploymen=
t.</div><div><br></div><div>I think a more apt comparison would be somethin=
g like whether you should use regular MapReduce on Yarn on Hadoop OR Spark =
on Hadoop.</div><div><br></div><div>Or even more direct would be Spark vs. =
Storm, which has been discussed here.</div><div><a href=3D"http://marc.info=
/?l=3Dhadoop-user&amp;m=3D140434265901449" target=3D"_blank">http://marc.in=
fo/?l=3Dhadoop-user&amp;m=3D140434265901449</a><br></div><div><br></div><di=
v>Regards,</div><div>Shahab<br><div><br></div><div><br></div></div></div><d=
iv class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><br><div c=
lass=3D"gmail_quote">On Fri, Apr 10, 2015 at 1:08 AM, Ashutosh Kumar <span =
dir=3D"ltr">&lt;<a href=3D"mailto:ashutosh.k78@gmail.com" target=3D"_blank"=
>ashutosh.k78@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail=
_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:=
1ex"><div dir=3D"ltr"><div>How do I decide whether I should go for Hadoop o=
r Spark for a greenfield project . I tried to find out and looks like Spark=
 can do everything that hadoop can do. Appreciate your thoughts on it.<br><=
br></div>Thanks<br><br></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--089e01634b9096f10405135e7d4e--