hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amogh Vasekar <am...@yahoo-inc.com>
Subject Re: Hive vs. Vertica
Date Mon, 19 Oct 2009 12:48:39 GMT
Yahoo! Had an Everest MPP framework based on columnar storage, don't know how popular it was,
but required pretty high end machines. Zebra I guess partially aims at getting that into Hadoop
using t-file implementation, and its source is available in contrib.

Amogh


On 10/19/09 10:18 AM, "Arijit Mukherjee" <arijit72@gmail.com> wrote:

I've been in touch with Vertica since the past year. The main concept
behind Vertica is column-orientation, which in turn allows high degree
of compression and faster query processing (mainly retrieving large
data sets) as it selects only the column required instead of fetching
the entire row and applying project on it. It's Mike Stonebraker's
brainchild and builds on the original C-Store.

Performance is very impressive on not-so-high-end hardware - loading
is very fast, as are the queries. But, as of now, it does not support
map-reduce. This is supported in Greenplum (another DW datastore) -
but my experience with Greenplum was not so good - performance-wise -
and it needs quite high-end machines.

Arijit

2009/10/17 Edward Capriolo <edlinuxguru@gmail.com>:
> On Sat, Oct 17, 2009 at 1:11 AM, Ryan LeCompte <lecompte@gmail.com> wrote:
>>
>> Guys,
>>
>> I am trying to understand Vertica and how it applies to the Hadoop world. Is
>> this basically a way to store large amounts of data and run SQL-like queries
>> on it that also result in map/red uce jobs Hadoop/Hive? Or am I trying to
>> compare apples and oranges? If not, are Vertica queries faster in getting
>> results than Hive (5 minutes versus seconds) ?
>>
>> Thanks,
>> Ryan
>>
>>
>
> There was a presentation on hadoop+vertica at hadoop world nyc.
>
> http://www.cloudera.com/hadoop-world-nyc
>
> http://www.cloudera.com/sites/all/themes/cloudera/static/hw09/3%20%20-%204-00%20Omer%20Trajman,%20Vertica,%20Hadoop%20-%20Vertica%20v2.ppt
>
> One major difference is a column based datastore while by default hive
> is row based, but there many differences.
>
> Edward
>



--
"And when the night is cloudy,
There is still a light that shines on me,
Shine on until tomorrow, let it be."


Mime
View raw message