hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sudarshan Kadambi (BLOOMBERG/ 731 LEXIN)" <skada...@bloomberg.net>
Subject Re: Easiest way to get a random sample of keys
Date Fri, 24 Jan 2014 23:35:03 GMT
Fair enough, but I was looking for a larger sample. Say, 5% of all data in my table that has
a few million rows. 

----- Original Message -----
From: user@hbase.apache.org
To: user@hbase.apache.org
At: Jan 24 2014 18:29:23

How many regions do you have? Can you take the first key of each region as
a sample?
Le 2014-01-24 18:15, "Sudarshan Kadambi (BLOOMBERG/ 731 LEXIN)" <
skadambi@bloomberg.net> a écrit :

> Something like count 't1', {INTERVAL=>20} should give me every 20th row in
> table 't1'. Is there an easy way to get a random sample via. the shell
> using filters?
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message