Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of wellington.chevreuil@gmail.com
 designates 209.85.216.174 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOaQLPMcNnwLDSDnWF5Gh7Ef+hfTk0+8_NVG4SrqyK-E37fxQw@mail.gmail.com>
References: 
 <CAOaQLPMcNnwLDSDnWF5Gh7Ef+hfTk0+8_NVG4SrqyK-E37fxQw@mail.gmail.com>
Date: Tue, 19 Feb 2013 21:27:38 +0000
Message-ID: 
 <CAGScPGvLAQyuN+=P9zQyDjJdUF9KB5-MnpkGPAMpdROnp4vNhA@mail.gmail.com>
Subject: Re: Newbie: HBase good for Tree like structure?
From: Wellington Chevreuil <wellington.chevreuil@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=047d7b676e6ec42ea504d61a8251

--047d7b676e6ec42ea504d61a8251
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi Jos=E9,

I think your structure is ok to define HBase row keys. The main issue
you`ll have then is row you`ll be able to build these keys, so that you can
properly access your tree nodes.

Regarding your scalability concerns, you should not worry to start with a
small Hadoop/Hbase cluster (even standalone) for development/concept proof
purposes, but that definitely will require a more robust environment if you
get to a billion of rows later. You'll have to start thinking on read/write
load patterns, so that you'll be able to take the best advantage of HBase
as your problem solution.

Regards,
Wellington.

2013/2/19 Jos=E9 Feiteirinha <j@feiteira.org>

> Dear all,
>
> I hope this is the right place for this question.
>
> I'm currently in the starting stages of developing a software that may
> 'explode' in terms of users and data. I'm considering a very basic
> tree-like data-structure and would like to know your thoughts regarding
> HBase/Hadoop.
>
> My reason is that I would like to be prepared from the get-go for large
> data.
>
> My structure is planned as such:
>
>    - The data be nodes of a huge multidimensional tree.
>    - I'm planning on having each row containing the full node path, e.g.
>    "root.grandparentX.parentY.babyZ" (or ? "babyZ.parentY.grandparentX.ro=
ot" )
>    - However in terms of data per node, it should be pretty much static.
>
>
> While this is a very simple structure, it does seem to be beneficial to
> use HBase / Hadoop just for the scalability alone. I also understood that
> if I get to billions of rows, only an HBase like approach can sustain me?
>
> My idea is to start with a simple standalone server and then expand the
> cluster as the load & data grow.
>
> If you may,
> I would like your thoughts, mostly regarding weather I'm using an Hammer
> to kill Ants, my proposed data-structure or any other advice you may have=
.
>
>
> Kind regards,
> Jos=E9
>
> --
> Jos=E9 Feiteirinha
>
> www.feiteira.org
>

--047d7b676e6ec42ea504d61a8251
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi Jos=E9,<div><br></div><div>I think your structure is ok to define HBase =
row keys. The main issue you`ll have then is row you`ll be able to build th=
ese keys, so that you can properly access your tree nodes.</div><div><br></=
div>
<div>Regarding your scalability concerns, you should not worry to start wit=
h a small Hadoop/Hbase cluster (even standalone) for development/concept pr=
oof purposes, but that definitely will require a more robust environment if=
 you get to a billion of rows later. You&#39;ll have to start thinking on r=
ead/write load patterns, so that you&#39;ll be able to take the best advant=
age of HBase as your problem solution.</div>
<div><br></div><div>Regards,</div><div>Wellington.=A0<br><br><div class=3D"=
gmail_quote">2013/2/19 Jos=E9 Feiteirinha <span dir=3D"ltr">&lt;<a href=3D"=
mailto:j@feiteira.org" target=3D"_blank">j@feiteira.org</a>&gt;</span><br><=
blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px=
 #ccc solid;padding-left:1ex">
<font face=3D"tahoma,sans-serif">Dear all,</font><div><font face=3D"tahoma,=
sans-serif"><br></font></div><div><font face=3D"tahoma,sans-serif">I hope t=
his is the right place for this question.</font></div><div><font face=3D"ta=
homa,sans-serif"><br>

</font></div><div><span style=3D"font-family:tahoma,sans-serif">I&#39;m cur=
rently in the starting stages of developing a software that may &#39;explod=
e&#39; in terms of users and data. I&#39;m considering a very basic tree-li=
ke data-structure and would like to know your thoughts regarding HBase/Hado=
op.</span></div>

<div><font face=3D"tahoma,sans-serif"><br></font></div><div><span style=3D"=
font-family:tahoma,sans-serif">My reason is that I would like to be prepare=
d from the get-go=A0</span><span style=3D"font-family:tahoma,sans-serif">fo=
r large data.</span></div>

<div><span style=3D"font-family:tahoma,sans-serif"><br></span></div><div><f=
ont face=3D"tahoma,sans-serif">My structure is planned as such:</font></div=
><div><ul><li><span style=3D"font-family:tahoma,sans-serif">The data be nod=
es of a huge multidimensional tree.</span></li>

<li><span style=3D"font-family:tahoma,sans-serif">I&#39;m planning on havin=
g each row containing the full node path, e.g. &quot;root.grandparentX.pare=
ntY.babyZ&quot; (or ? &quot;babyZ.parentY.grandparentX.root&quot; )</span><=
/li>

<li><span style=3D"font-family:tahoma,sans-serif">However in terms of data =
per node, it should be pretty much static.</span></li></ul></div><div><font=
 face=3D"tahoma, sans-serif"><br></font></div><div><font face=3D"tahoma,san=
s-serif">While this is a very simple structure, it does seem to be benefici=
al to use HBase / Hadoop just for the=A0</font><font face=3D"tahoma, sans-s=
erif">scalability=A0alone. I also understood that if I get to billions of r=
ows, only an HBase like approach can sustain me?</font></div>

<div><font face=3D"tahoma, sans-serif"><br></font></div><div><font face=3D"=
tahoma, sans-serif">My idea is to start with a simple standalone server and=
 then expand the cluster as the load &amp; data grow.</font></div><div><fon=
t face=3D"tahoma, sans-serif"><br>

</font></div><div><font face=3D"tahoma, sans-serif">If you may,</font></div=
><div><font face=3D"tahoma, sans-serif">I would like your thoughts, mostly =
regarding weather I&#39;m using an Hammer to kill Ants, my proposed=A0data-=
structure=A0or any other advice you may have.</font></div>

<div><font face=3D"tahoma, sans-serif"><br></font></div><div><font face=3D"=
tahoma, sans-serif"><br></font></div><div><font face=3D"tahoma, sans-serif"=
>Kind regards,</font></div><div><font face=3D"tahoma, sans-serif">Jos=E9</f=
ont></div>

<div><font face=3D"tahoma,sans-serif"><br clear=3D"all"></font><div>--<div>=
Jos=E9 Feiteirinha</div><div><br></div><div><a href=3D"http://www.feiteira.=
org/" target=3D"_blank">www.feiteira.org</a></div></div>
</div>
</blockquote></div><br></div>

--047d7b676e6ec42ea504d61a8251--