hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stuti Awasthi <stutiawas...@hcl.com>
Subject RE: Version control of files present in HDFS
Date Tue, 22 Nov 2011 06:42:04 GMT
Well the documents can be in various formats like pdf,doc,xls, video file,ppt etc and can vary
in size from MBs to GBs.
I have read that if the file sizes are going bigger than MBs then it is not good to use Hbase
for storage.

Any Comments

From: Ted Dunning [mailto:tdunning@maprtech.com]
Sent: Tuesday, November 22, 2011 11:43 AM
To: hdfs-user@hadoop.apache.org
Subject: Re: Version control of files present in HDFS

How big is that?
On Mon, Nov 21, 2011 at 9:26 PM, Stuti Awasthi <stutiawasthi@hcl.com<mailto:stutiawasthi@hcl.com>>
wrote:
Hi Ted,
Well in my case document size can be big, which is not good to keep in Hbase. So I rule out
this option.

Thanks

From: Ted Dunning [mailto:tdunning@maprtech.com<mailto:tdunning@maprtech.com>]
Sent: Monday, November 21, 2011 8:57 PM
To: hdfs-user@hadoop.apache.org<mailto:hdfs-user@hadoop.apache.org>
Subject: Re: Version control of files present in HDFS

HDFS is a filesystem that is designed to support map-reduce computation.  As such, the semantics
differ from what SVN or GIT would want to have.

HBase provides versioned values.  That might suffice for your needs.
On Mon, Nov 21, 2011 at 9:58 AM, Stuti Awasthi <stutiawasthi@hcl.com<mailto:stutiawasthi@hcl.com>>
wrote:
Do we have any support from GIT or SVN to attain that ?

-----Original Message-----
From: Harsh J [mailto:harsh@cloudera.com<mailto:harsh@cloudera.com>]
Sent: Monday, November 21, 2011 3:23 PM
To: hdfs-user@hadoop.apache.org<mailto:hdfs-user@hadoop.apache.org>
Subject: Re: Version control of files present in HDFS

HDFS is merely a filesystem today. You'll need to write your own versioning layer on top of
it if you need such a functionality.

On Mon, Nov 21, 2011 at 2:52 PM, Stuti Awasthi <stutiawasthi@hcl.com<mailto:stutiawasthi@hcl.com>>
wrote:
> Hi All,
> I want to provide version control for documents which I keep in HDFS. What I can do to
attain this. Any ideas ?
>
> Regards,
> Stuti Awasthi
>
>
> ::DISCLAIMER::
> ----------------------------------------------------------------------
> -------------------------------------------------
>
> The contents of this e-mail and any attachment(s) are confidential and intended for the
named recipient(s) only.
> It shall not attach any liability on the originator or HCL or its
> affiliates. Any views or opinions presented in this email are solely those of the author
and may not necessarily reflect the opinions of HCL or its affiliates.
> Any form of reproduction, dissemination, copying, disclosure,
> modification, distribution and / or publication of this message
> without the prior written consent of the author of this e-mail is
> strictly prohibited. If you have received this email in error please delete it and notify
the sender immediately. Before opening any mail and attachments please check them for viruses
and defect.
>
> ----------------------------------------------------------------------
> -------------------------------------------------
>



--
Harsh J



Mime
View raw message