hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rajeshkumar J <rajeshkumarit8...@gmail.com>
Subject Re: How to increase scan speed in Hbase
Date Sat, 28 Nov 2015 05:51:08 GMT
Hi Ted,

  I want to pull all the rows of the column id irrespective of row no and
time range. So it won't work for my use case.

Thanks

On Fri, Nov 27, 2015 at 10:04 PM, Ted Yu <yuzhihong@gmail.com> wrote:

> HBase itself doesn't provide secondary indexing support, yet.
>
> How many regions does your table have ?
>
> In my first response, I mentioned using STARTROW, STOPROW or TIMERANGE to
> narrow the scope of search.
> Have you considered that ?
>
> Cheers
>
> On Thu, Nov 26, 2015 at 9:07 PM, Rajeshkumar J <
> rajeshkumarit8292@gmail.com>
> wrote:
>
> > Hi Ted,
> >
> >      I have tried major compaction but it is of no use and it had no
> impact
> > on the performance. After searching I have found that secondary indexing
> > will boost the scanning performance. But I didnt find any useful
> resource.
> > Can you suggest me a site or any others?
> >
> > On Thu, Nov 26, 2015 at 8:32 PM, Ted Yu <yuzhihong@gmail.com> wrote:
> >
> > > Excerpt from
> hbase-shell//src/main/ruby/shell/commands/major_compact.rb :
> > >
> > >           Examples:
> > >           Compact all regions in a table:
> > >           hbase> major_compact 't1'
> > >
> > > Cheers
> > >
> > > On Wed, Nov 25, 2015 at 10:00 PM, Rajeshkumar J <
> > > rajeshkumarit8292@gmail.com
> > > > wrote:
> > >
> > > > Hi Ted Yu,
> > > >
> > > >     No I have not major compacted the table. Can you tell me how to
> do
> > > this
> > > > or suggest me a link.
> > > >
> > > > Thanks,
> > > > RajeshKumar J
> > > >
> > > > On Wed, Nov 25, 2015 at 9:26 PM, Ted Yu <yuzhihong@gmail.com> wrote:
> > > >
> > > > > After loading the data, have you major compacted the table ?
> > > > >
> > > > > You can include STARTROW, STOPROW and TIMERANGE for your scan to
> > narrow
> > > > the
> > > > > scope.
> > > > >
> > > > > FYI
> > > > >
> > > > > On Wed, Nov 25, 2015 at 2:36 AM, Rajeshkumar J <
> > > > > rajeshkumarit8292@gmail.com>
> > > > > wrote:
> > > > >
> > > > > > Hi,
> > > > > >
> > > > > >
> > > > > > I am new to Apache Hbase and I am using hbase-0.98.13 and I
have
> > > > created
> > > > > a
> > > > > > table sample with column family sample_family. And I have loaded
> > the
> > > > > output
> > > > > > from pig script to hbase table. when I try to scan the table
> based
> > on
> > > > one
> > > > > > of the column in column family it takes more than 2 minutes.
> > > > > >
> > > > > > Here is the query
> > > > > >
> > > > > > scan 'sample',
> > > > > >
> > > >
> > {FILTER=>"SingleColumnValueFilter('sample_family','id',=,'binary:1000')"}
> > > > > > Can any one tell me how to bring this process in one or two
> > seconds?
> > > > > >
> > > > > > I have searched and most of the sites suggest me to change the
> > > default
> > > > > > value of hbase.client.scanner.caching property and I have changed
> > it
> > > > too
> > > > > > but there is no change in performance.
> > > > > > Is there any configuration changes to be made for this? Can
any
> one
> > > > help
> > > > > me
> > > > > > in this?
> > > > > >
> > > > > > Regards,
> > > > > > Rajeshkumar J
> > > > > >
> > > > >
> > > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message