hi, Ted.
Do you know whether there is optimization on scan with TimeRange?
Actually, if set a sparse TimeRange and large scan cache, it will cause rpc
time out sometimes.
Actually, want to know whether it requires scanning each KV for checking
its timestamp?
Thanks,
-Bing
2014-06-28 21:25 GMT+08:00 Ted Yu <yuzhihong@gmail.com>:
> Have you looked at the following method in AggregationClient ?
>
> long rowCount(final HTable table,
>
> final ColumnInterpreter<R, S, P, Q, T> ci, final Scan scan) throws
> Throwable {
>
> You can specify timerange through scan parameter.
>
> See this method of Scan:
>
> public Scan setTimeRange(long minStamp, long maxStamp)
>
> Cheers
>
>
> On Sat, Jun 28, 2014 at 3:42 AM, yogi <yogidave15@gmail.com> wrote:
>
> > Hi,
> >
> > I have a requirement where I have to make a shell script using which i
> need
> > to scan some 6 huge hbase tables and get the count of records present in
> > them. Also i need the counts per day wise where i pass the date parameter
> > to
> > the shell script which calls these scan commands. I did find a way to
> > convert the date to epoch time and pass it to scan command but the scan
> > keeps running forever. Can some one help me in making this faster.
> >
> > Note: I am scanning the tables based on TIMERANGE as all the tables have
> > this field.
> >
> > Thanks,
> > Yogi
> >
> >
> >
> > --
> > View this message in context:
> >
> http://apache-hbase.679495.n3.nabble.com/Hbase-scan-using-TIMERANGE-tp4060851.html
> > Sent from the HBase User mailing list archive at Nabble.com.
> >
>
|