hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aditya Kumar <adityakumar...@yahoo.com>
Subject Re: Amazon s3 and Hadoop/Hbase
Date Sat, 26 Nov 2011 23:38:32 GMT
Thanks Ram.

More details:
I want to keep the files for now.

when you say Analyses what type of analysis need HBase?
Can you please mention them..
As I am ina   very early state I might not need any other thing than Keeping, BUT if it is
needed for Analysies can you please let me know what it is >

 From: Ramkrishna S Vasudevan <ramkrishna.vasudevan@huawei.com>
To: dev@hbase.apache.org; 'Aditya Kumar' <adityakumar345@yahoo.com> 
Sent: Thursday, November 24, 2011 9:07 PM
Subject: RE: Amazon s3 and Hadoop/Hbase

If your use case is like just store some files particularly if you need only
a big storage medium then S3 should be fine. 

HBase is a database where you want your data to be added from some specific
source like a social site and you want to do some analysis.  


-----Original Message-----
From: Aditya Kumar [mailto:adityakumar345@yahoo.com] 
Sent: Friday, November 25, 2011 9:26 AM
To: dev@hbase.apache.org
Subject: Re: Amazon s3 and Hadoop/Hbase

Thanks Sonal for the excellent reply.

I am looking for a use case:
where in I want to store 10000 files and they will be accessed and modified
and kept back in the S3.

so there will be  modifications of Files on S3 and trying to access them.

For a use case like this do I need Hbase on S3? (or) if I just use S3,
create bucket and keep on adding files and modify them on the need bases

From: Sonal Goyal <sonalgoyal4@gmail.com>
To: dev@hbase.apache.org; Aditya Kumar <adityakumar345@yahoo.com> 
Sent: Thursday, November 24, 2011 7:11 PM
Subject: Re: Amazon s3 and Hadoop/Hbase


S3 is a file storage service. It does provide versioning and key based file
access, but it does not provide any processing paradigm, like Map Reduce
for Hadoop. HBase provides random access to data - its more like a database
in that you can have a key based access for records, and it is a columnar
store. S3 can be compared with HDFS, the hadoop distributed file system,
and in fact one can spin Hadoop clusters off S3 file system.

Maybe if you explain your problem in detail, we can help you further on the
mailing list. You can also read up the fundamentals on the Hadoop and HBase
wiki, there is a high level introduction which can clear the basic doubts.

Hope that helps.

Best Regards,
Crux: Reporting for HBase <https://github.com/sonalgoyal/crux>
Nube Technologies <http://www.nubetech.co>


On Fri, Nov 25, 2011 at 7:10 AM, Aditya Kumar

> Thanks Li Pi.
> so,
> does it mean that s3 is hadoop?
> can you please let me know what I will be missing if I use s3 only with
> out Hbase?
> 2) In s3 there is delete operation, does it really delete the key and
> there by value?
> ________________________________
>  From: Li Pi <li@idle.li>
> To: Aditya Kumar <adityakumar345@yahoo.com>; dev@hbase.apache.org
> Sent: Thursday, November 24, 2011 5:29 PM
> Subject: Re: Amazon s3 and Hadoop/Hbase
> You can run hbase against s3.
> On Nov 24, 2011 5:11 PM, "Aditya Kumar" <adityakumar345@yahoo.com> wrote:
> > Hi,
> > I am looking at Amazon S3.
> > is it like HDFS/Hadoop? .
> > If it is like Hadoop can we add only data to the tables.. than can we
> > Hbase?
> > I tried to read online,
> > all I see that S3 will not delete the data  and will do versioing
> > (something similar to Hadoop)
> >
> > TIA,
> > Adi
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message