hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Manish Maheshwari <mylogi...@gmail.com>
Subject Re: HBase - Count Rows in Regions and Region Servers
Date Fri, 26 Aug 2016 22:19:43 GMT
Hey.. Approx we have 294 regions in 42 region servers.

Manish

On Fri, Aug 26, 2016 at 3:05 PM, Ted Yu <yuzhihong@gmail.com> wrote:

> I currently don't have concrete numbers but the impact is not big.
>
> How many regions are there in the table(s) ?
>
> Cheers
>
> On Fri, Aug 26, 2016 at 2:57 PM, Manish Maheshwari <myloginid@gmail.com>
> wrote:
>
> > Thanks Ted. I looked into using JMX. Unfortunately it requires us to
> > restart HBase after the config changes. In the production environment we
> > are unable to do so. The table size is small. Around 9.6 TB. We have
> around
> > 42 nodes each with 10 TB storage. The scan will take time, but would
> need a
> > HBase restart.
> >
> > We will enable JMX at the next opportunity for restart. In general the
> > impact on JMX would be less than 2-3% on HBase performance?
> >
> > Thanks,
> > Manish
> >
> >
> > On Fri, Aug 26, 2016 at 12:11 PM, Ted Yu <yuzhihong@gmail.com> wrote:
> >
> > > Have you looked at /jmx endpoint on the servers ?
> > > Below is a sample w.r.t. the metrics that would be of interest to you:
> > >
> > >
> > > "Namespace_default_table_x_region_6659ba3fe42b4a196daaba9306b505
> > > 51_metric_appendCount"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_num_ops"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_min"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_max"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_mean"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_median"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_75th_percentile"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_95th_percentile"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_f9965e20458e7dbf3d4d5b439ae576
> > > ad_metric_scanNext_99th_percentile"
> > > : 0.0,
> > >
> > >
> > > "Namespace_default_table_x_region_823a39a250e81f45e5ef493740d936
> > > ab_metric_deleteCount"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_30b82db17b64a83d4aeda9dbd40d62
> > > 15_metric_deleteCount"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_c6db2e650b3025aa82032b0e0aa8b7
> > > 15_metric_appendCount"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_num_ops"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_min"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_max"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_mean"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_median"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_75th_percentile"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_95th_percentile"
> > > : 0.0,
> > >
> > > "Namespace_default_table_x_region_94db4fcd7cabc28c406681f172df21
> > > 86_metric_get_99th_percentile"
> > > : 0.0,
> > >
> > >
> > > "Namespace_default_table_x_region_5a1fe60f6267c98055b334784e6d76
> > > d2_metric_mutateCount"
> > > : 0,
> > >
> > > "Namespace_default_table_x_region_66bbec5f7e136b226a19b5fdf9f17c
> > > be_metric_incrementCount"
> > > : 0,
> > >
> > > On Fri, Aug 26, 2016 at 11:59 AM, Manish Maheshwari <
> myloginid@gmail.com
> > >
> > > wrote:
> > >
> > > > Hi Ted,
> > > >
> > > > I understand the region crash/migration/splitting impact. Currently
> we
> > > have
> > > > hotspotting on few region servers. I am trying to collect the row
> stats
> > > at
> > > > region server and region levels to see how bad the skew of the data
> is.
> > > >
> > > > Manish
> > > >
> > > > On Fri, Aug 26, 2016 at 10:19 AM, Ted Yu <yuzhihong@gmail.com>
> wrote:
> > > >
> > > > > Can you elaborate on your use case ?
> > > > >
> > > > > Suppose row A is on server B, after you retrieve row A, the region
> > for
> > > > row
> > > > > A gets moved to server C (load balancer or server crash). Server
B
> > > would
> > > > no
> > > > > longer be relevant.
> > > > >
> > > > > Cheers
> > > > >
> > > > > On Fri, Aug 26, 2016 at 10:07 AM, Manish Maheshwari <
> > > myloginid@gmail.com
> > > > >
> > > > > wrote:
> > > > >
> > > > > > Hi,
> > > > > >
> > > > > > I looked at the HBase Count functionality to count rows in a
> Table.
> > > Is
> > > > > > there a way that we can count the number of rows in Regions
&
> > Region
> > > > > > Servers? When we use a HBase scan, we dont get the Region ID
or
> > > Region
> > > > > > Server of the row. Is there a way to do this via Scans?
> > > > > >
> > > > > > Thanks,
> > > > > > Manish
> > > > > >
> > > > >
> > > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message