hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ramkrishna vasudevan <ramkrishna.s.vasude...@gmail.com>
Subject Re: Column family names and data size on disk
Date Wed, 28 Nov 2012 15:36:46 GMT
Nice link .... Did not know about this.

Regards
Ram

On Wed, Nov 28, 2012 at 9:03 PM, Mohammad Tariq <dontariq@gmail.com> wrote:

> Along with whatever Ram sir has said, you may also find this link useful :
> http://blog.cloudera.com/blog/2012/06/hbase-io-hfile-input-output/
>
> Regards,
>     Mohammad Tariq
>
>
>
> On Wed, Nov 28, 2012 at 8:44 PM, ramkrishna vasudevan <
> ramkrishna.s.vasudevan@gmail.com> wrote:
>
> > Also think of the keyvalue that is in the memstore.
> > The entire keyvalue is used for byte ordering.
> >
> > So in the memory if i have CF1 and CF2, by ordering Cf1 should be sorted
> > first.  This also is a reason why Col family is part of the keyvalue
> which
> > when flushed into HFile goes with the same format.
> >
> > Regards
> > Ram
> >
> > On Wed, Nov 28, 2012 at 8:28 PM, ramkrishna vasudevan <
> > ramkrishna.s.vasudevan@gmail.com> wrote:
> >
> > > I can find only the JIRA id pertaining to it
> > >
> > > https://issues.apache.org/jira/browse/HBASE-4218.
> > >
> > > Basically what i can understand from the design is HBase is a mulit
> level
> > > keyvalue map
> > >
> > > {rowkey->CF->Cols->TimeStamp}>value
> > >
> > > Map<Rowkey,<Map<CF,Map<colQual,Map<TimeStamp, Value>>>
> > >
> > > So every cell has all the info(rowkey, cf, qual, ts and value).
> > > And ideally the encoding algo is basically to avoid such repetitions.
> > > Anyone needs to add on this? Pls feel to do so...
> > >
> > > Regards
> > > Ram
> > >
> > >
> > > On Wed, Nov 28, 2012 at 8:10 PM, matan <matan@cloudaloe.org> wrote:
> > >
> > >> Thanks Ram,
> > >>
> > >> Why does the CF have to be in the HFile, isn't the entire HFile
> > dedicated
> > >> to
> > >> just one CF to start with (I'm speaking at the HBase architecture
> level,
> > >> trying to figure why it is working as like it is).
> > >>
> > >> That was my main interest in my question, but could you indicate a tad
> > >> more
> > >> what those encoding algorithms should be useful for or suggest a link
> > for
> > >> reading about them?
> > >>
> > >> Thanks,
> > >> Matan
> > >>
> > >>
> > >>
> > >> --
> > >> View this message in context:
> > >>
> >
> http://apache-hbase.679495.n3.nabble.com/Column-family-names-and-data-size-on-disk-tp4034507p4034509.html
> > >> Sent from the HBase User mailing list archive at Nabble.com.
> > >>
> > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message