hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Dimiduk <ndimi...@gmail.com>
Subject Re: Re: About HBASE-3149
Date Mon, 23 Dec 2013 22:55:19 GMT
Committed via HBASE-10234.


On Mon, Dec 23, 2013 at 2:35 PM, Stack <stack@duboce.net> wrote:

> I'd do non-relational database as per Jon... just make the change Nick....
> I'll push out later no worries (it is a bit of a pain... learn it by all
> means but this is not cause enough)
>
>
> On Mon, Dec 23, 2013 at 1:03 PM, Jonathan Hsieh <jon@cloudera.com> wrote:
>
> > I usually say, non-relational database.
> >
> > Jon.
> >
> >
> > On Mon, Dec 23, 2013 at 11:52 AM, Nick Dimiduk <ndimiduk@gmail.com>
> wrote:
> >
> > > On Sat, Dec 21, 2013 at 7:07 PM, 乃岩 <sohomodern@126.com> wrote:
> > >
> > > > Hello, Thank you for your reply.
> > > > If use only 1 or 2 CFs, why Hbase say it's a column data store? It's
> > > > actually row-based data store!!
> > > >
> > >
> > > Nit: I don't think HBase claims to be a column-oriented store.
> > > "Column-family," yes... But now that I write this, I see right on the
> > > project homepage: "Apache HBase is an open-source, distributed,
> > versioned,
> > > *column-oriented* store modeled after ..." (emphasis my own).
> > >
> > > From: Andrew Purtell
> > > > Date: 2013-12-22 04:26
> > > > To: dev@hbase.apache.org; sohomodern
> > > > Subject: Re: About HBASE-3149
> > > > Bear in mind that how many files you'll have open simultaneously is a
> > > > function of number of regions, number of column families, and how
> > > > compaction organizes the HBase files on disk (the strategy in effect
> > and
> > > > its parameters, the current ingest rate, and so on). You call
> ballpark
> > > this
> > > > as such: If you have one column family in a table, and store data
> into
> > > all
> > > > the regions, then you will have one file open on the cluster per
> > region,
> > > or
> > > > more. If you have 100,000 column families in a table, and store data
> > into
> > > > all the regions and CFs, then you will have 100,000 files open on the
> > > > cluster per region, *or more*. You will run into OS and HDFS levels
> > > > attempting this, I don't recommend it.
> > > >
> > > >
> > > > I don't think any reasonable schema design needs produce a
> requirement
> > > for
> > > > 100,000 column *families*. You can have any number of keys with
> > > > <column>:<qualifier> in a column family, varying the <qualifier>
to
> > > 100,000
> > > > or 1,000,000 or more unique values is no problem. Can you say more
> > about
> > > > what you are trying to accomplish?
> > > >
> > > >
> > > >
> > > > On Sat, Dec 21, 2013 at 7:17 AM, 乃岩 <sohomodern@126.com> wrote:
> > > >
> > > > Hi,
> > > >    Can anybody tell me if future HBase release will integrate 3149
> for
> > > > Make flush decisions per column family?
> > > >
> > > >   By the way, for current HBase, if the simultaneous flush is the
> only
> > > > issue? I mean, to create 100000 CFs will not be a problem, right?
> > > >
> > > >   Thanks in advance!
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > N.Y.
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > --
> > > >
> > > > Best regards,
> > > >
> > > >    - Andy
> > > >
> > > > Problems worthy of attack prove their worth by hitting back. - Piet
> > Hein
> > > > (via Tom White)
> > > >
> > >
> >
> >
> >
> > --
> > // Jonathan Hsieh (shay)
> > // Software Engineer, Cloudera
> > // jon@cloudera.com
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message