hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Hitchcock" <adpow...@gmail.com>
Subject Re: Pig | Yahoo! Research
Date Thu, 26 Apr 2007 23:53:03 GMT
My understanding is that both Pig and Sawzall are intended for batch
data processing and provide an abstraction layer above MapReduce.

BigTable, on the other hand, is more of a long-term random-access database.

On 4/26/07, Ian Holsman <lists@holsman.net> wrote:
> Jim Kellerman wrote:
> > Can someone comment on how Pig compares with Bigtable?
> >
> > On Thu, 2007-04-26 at 13:10 -0700, Doug Cutting wrote:
> >
> >> FYI
> >>
> >> http://research.yahoo.com/project/pig
> >>
> >> Doug
> >>
> my understanding is
>
> bigtable/hbase stores the data
> mapreduce/hadoop manipulates/creates the data to be stored in bigtable
> via functions, and controls the distribution
> sawzall/pig is a query language to extract information from it. I think
> it would use create functions for mapreduce/hadoop to run.
>
> regards
> Ian
>
>

Mime
View raw message