hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohammad Tariq <donta...@gmail.com>
Subject Re: Column family names and data size on disk
Date Wed, 28 Nov 2012 15:33:48 GMT
Along with whatever Ram sir has said, you may also find this link useful :
http://blog.cloudera.com/blog/2012/06/hbase-io-hfile-input-output/

Regards,
    Mohammad Tariq



On Wed, Nov 28, 2012 at 8:44 PM, ramkrishna vasudevan <
ramkrishna.s.vasudevan@gmail.com> wrote:

> Also think of the keyvalue that is in the memstore.
> The entire keyvalue is used for byte ordering.
>
> So in the memory if i have CF1 and CF2, by ordering Cf1 should be sorted
> first.  This also is a reason why Col family is part of the keyvalue which
> when flushed into HFile goes with the same format.
>
> Regards
> Ram
>
> On Wed, Nov 28, 2012 at 8:28 PM, ramkrishna vasudevan <
> ramkrishna.s.vasudevan@gmail.com> wrote:
>
> > I can find only the JIRA id pertaining to it
> >
> > https://issues.apache.org/jira/browse/HBASE-4218.
> >
> > Basically what i can understand from the design is HBase is a mulit level
> > keyvalue map
> >
> > {rowkey->CF->Cols->TimeStamp}>value
> >
> > Map<Rowkey,<Map<CF,Map<colQual,Map<TimeStamp, Value>>>
> >
> > So every cell has all the info(rowkey, cf, qual, ts and value).
> > And ideally the encoding algo is basically to avoid such repetitions.
> > Anyone needs to add on this? Pls feel to do so...
> >
> > Regards
> > Ram
> >
> >
> > On Wed, Nov 28, 2012 at 8:10 PM, matan <matan@cloudaloe.org> wrote:
> >
> >> Thanks Ram,
> >>
> >> Why does the CF have to be in the HFile, isn't the entire HFile
> dedicated
> >> to
> >> just one CF to start with (I'm speaking at the HBase architecture level,
> >> trying to figure why it is working as like it is).
> >>
> >> That was my main interest in my question, but could you indicate a tad
> >> more
> >> what those encoding algorithms should be useful for or suggest a link
> for
> >> reading about them?
> >>
> >> Thanks,
> >> Matan
> >>
> >>
> >>
> >> --
> >> View this message in context:
> >>
> http://apache-hbase.679495.n3.nabble.com/Column-family-names-and-data-size-on-disk-tp4034507p4034509.html
> >> Sent from the HBase User mailing list archive at Nabble.com.
> >>
> >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message