Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3B66DF74A for ; Sat, 5 Oct 2013 00:18:43 +0000 (UTC) Received: (qmail 37086 invoked by uid 500); 5 Oct 2013 00:18:43 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 37053 invoked by uid 500); 5 Oct 2013 00:18:43 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 37044 invoked by uid 99); 5 Oct 2013 00:18:43 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Oct 2013 00:18:43 +0000 Date: Sat, 5 Oct 2013 00:18:42 +0000 (UTC) From: "Ted Yu (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-9272) A simple parallel, unordered scanner MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-9272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13786783#comment-13786783 ] Ted Yu commented on HBASE-9272: ------------------------------- License is missing for ParallelClientScanner.java Please add annotation for audience. {code} + // reader interface + public static interface ResultReader { ... + // writer interface + public static interface ResultWriter { {code} Looks like the above classes can be private. {code} + } catch (InterruptedException ix) { + // ignore {code} Restore interrupt status ? > A simple parallel, unordered scanner > ------------------------------------ > > Key: HBASE-9272 > URL: https://issues.apache.org/jira/browse/HBASE-9272 > Project: HBase > Issue Type: New Feature > Reporter: Lars Hofhansl > Assignee: Lars Hofhansl > Priority: Minor > Attachments: 9272-0.94.txt, 9272-0.94-v2.txt, 9272-0.94-v3.txt, 9272-0.94-v4.txt, 9272-trunk.txt, ParallelClientScanner.java, ParallelClientScanner.java > > > The contract of ClientScanner is to return rows in sort order. That limits the order in which region can be scanned. > I propose a simple ParallelScanner that does not have this requirement and queries regions in parallel, return whatever gets returned first. > This is generally useful for scans that filter a lot of data on the server, or in cases where the client can very quickly react to the returned data. > I have a simple prototype (doesn't do error handling right, and might be a bit heavy on the synchronization side - it used a BlockingQueue to hand data between the client using the scanner and the threads doing the scanning, it also could potentially starve some scanners long enugh to time out at the server). > On the plus side, it's only a 130 lines of code. :) -- This message was sent by Atlassian JIRA (v6.1#6144)