Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 82D8210359 for ; Wed, 15 Jan 2014 07:08:42 +0000 (UTC) Received: (qmail 28897 invoked by uid 500); 15 Jan 2014 07:08:39 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 28209 invoked by uid 500); 15 Jan 2014 07:08:37 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 28192 invoked by uid 99); 15 Jan 2014 07:08:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Jan 2014 07:08:35 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ramon@appannie.com designates 209.85.215.42 as permitted sender) Received: from [209.85.215.42] (HELO mail-la0-f42.google.com) (209.85.215.42) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Jan 2014 07:08:30 +0000 Received: by mail-la0-f42.google.com with SMTP id n7so1100839lam.15 for ; Tue, 14 Jan 2014 23:08:09 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=4tKFJTJC5U8dpf1dFJ1mxGwMnzA7qxs5+LQs/pftMfs=; b=VnWlGeYH6ORKtKes2I0iI0/E2H0e3Rai5WeNMahDrjm8ToDr5HZ9U/YuN3szO+2nIp XH95ob51nvUK5fE58HfRO41XNtunVWyC+ZRl0qmrHcJAQZv5UKA6qE8TfS2p70zhnLWF 0s2aomtvEh4M8id3Jo2C8E8Xajb3LDVTEfiCh45DExlhnLZZlSeuk2ibUP/yiGISQdqz Vy9AfFAMPHKlSTy7uheHI2bLM4+CWNkt2vgvcffmX8t9YQlniZLoBgHm2j15ichmYzA1 eSQR1UlID6xRfyorJ0iHBO2yBE5AZ0oKkyhk5kQliAd3NdPTHX9lJx5M5tf47epWegs8 G2YA== X-Gm-Message-State: ALoCoQnJ5Xuz8USA8zN2jfzkEzAYKqwWYm8dOUAgKOysHEsO0ey+ii/u8eZNZNS/5ANxZg9arV2A MIME-Version: 1.0 X-Received: by 10.112.92.112 with SMTP id cl16mr368200lbb.15.1389769688866; Tue, 14 Jan 2014 23:08:08 -0800 (PST) Received: by 10.115.4.38 with HTTP; Tue, 14 Jan 2014 23:08:08 -0800 (PST) In-Reply-To: <5A5AB29C-A0B0-4497-AB5F-83FD04CF1E44@gmail.com> References: <5A5AB29C-A0B0-4497-AB5F-83FD04CF1E44@gmail.com> Date: Wed, 15 Jan 2014 15:08:08 +0800 Message-ID: Subject: Re: Fast scan with PrefixFilter? From: Ramon Wang To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a11336c569ff1c904effcf86b X-Virus-Checked: Checked by ClamAV on apache.org --001a11336c569ff1c904effcf86b Content-Type: text/plain; charset=ISO-8859-1 Hi Ted Thanks for the quick reply. With this FuzzyRowFilter, do i still need to pass in startRow and stopRow like below when constructing a Scan object? > Scan(byte [] startRow, byte [] stopRow) Will the FuzzyRowFilter provide us performance like a directly get by row when we pass something like "20140101_EN_?" Cheers Ramon On Wed, Jan 15, 2014 at 2:22 PM, Ted Yu wrote: > Please take a look at > http://hbase.apache.org/0.94/apidocs/org/apache/hadoop/hbase/filter/FuzzyRowFilter.html > > Cheers > > On Jan 14, 2014, at 10:16 PM, Ramon Wang wrote: > > > Hi Folks > > > > We have a table with fixed pattern row key design, the format for the row > > key is YEAR_COUNTRY_randomNumber, for example: > > > > 20140101_EN_1 > > 20140101_EN_2 > > 20140101_EN_3 > > 20140101_US_1 > > 20140101_US_2 > > 20140101_US_3 > > ... > > > > Is there a way i can quickly get the data for "20140101_EN_*" by using > Scan > > without scan the full table? I think we are probably going to use > > the PrefixFilter filter with the Scan object, but the problem is that we > > don't know the "startRow" for each scan, any ideas? > > > > Thanks > > Ramon > --001a11336c569ff1c904effcf86b--