hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jay Vyas <jayunit...@gmail.com>
Subject Re: real time analytics on hadoop using spark or storm
Date Sat, 07 Dec 2013 01:11:43 GMT
Spark increases performance by using distributed shared memory.

Storm on the other hand gives you realtime performance by processing data
sets in small batches.

The case for Spark is when you want a more sophisticated data processing.

The case for Storm is when you have large volumes of incoming data and you
want to run a process every 1000 records.

If you want a better comparison, try comparing spark-streaming with storm.

On Fri, Dec 6, 2013 at 7:04 PM, Smarty Juice <smarty.juice@gmail.com> wrote:

> can anyone explain what is the clear difference between spark and storm
> what are the use case of storm and spark?
> can it be used without haddop?
> what are the pros and cons of running with or without hadoop?
> thanks

Jay Vyas

View raw message