hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sam Seigal <selek...@yahoo.com>
Subject Re: querying questions
Date Fri, 28 Oct 2011 22:53:44 GMT
Open TDSB does it I believe :

http://opentsdb.net/schema.html

I am curious to know although the difference between having string as a row
key, converting them into bytes, and then storing the key , as opposed to
having numerical values stored as native bytes as bit masks.



On Fri, Oct 28, 2011 at 3:27 PM, Rita <rmorgan466@gmail.com> wrote:

> Thanks for your reply.
>
> I am curious, can you give me an example of having a key as a raw bytes and
> do byte comparison? I am not a native java programmer so an example would
> be extremely helpful in my case.
>
>
>
> On Fri, Oct 28, 2011 at 12:23 PM, Stack <stack@duboce.net> wrote:
>
> > On Fri, Oct 28, 2011 at 4:30 AM, Rita <rmorgan466@gmail.com> wrote:
> > > Couple of questions:
> > > What is the best delimiter for a key? Does it even matter? I read
> > somewhere
> > > that using a \t is optimal for a reason.
> > >
> >
> > Do without a delimiter if you can.  Just make the row key elements of
> > fixed size.
> >
> > It looks like though that your your key schema would require you have
> > a delimiter (I'm guessing 'server' can be anything -- or can it be
> > contained so all servers have same size'd name?)
> >
> > If you have to have a delimiter, choose one that is illegal in a
> > server name or user name so you can be sure it doesn't show up in
> > either ever and throw off your parse.
> >
> > > For these types of queries I have been using filters particularly,
> > > RegexStringComparator
> > > (w/start&stop) and things seem to work to an extent. I was wondering is
> > this
> > > the correct way to query or is there a more optimal way?
> > >
> >
> > Regex'ing over keys will be expensive.  HBase is all bytes.  To regex,
> > you need to change the bytes into a String.  Java Strings are i18n and
> > multi-byte natively so it costs making them.  Can you make your key as
> > raw bytes and do byte compares in your filtering?
> >
> > > I also couldnt find any examples using filters for timeseries data, is
> > there
> > > a place I should be looking at?
> > >
> > >
> >
> > I thought tsdb used filters?
> >
> > St.Ack
> >
>
>
>
> --
> --- Get your facts first, then you can distort them as you please.--
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message