Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 89FBC200B59 for ; Mon, 8 Aug 2016 15:58:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 88AC9160AB4; Mon, 8 Aug 2016 13:58:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id CFE23160A77 for ; Mon, 8 Aug 2016 15:58:21 +0200 (CEST) Received: (qmail 10364 invoked by uid 500); 8 Aug 2016 13:58:21 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 10342 invoked by uid 99); 8 Aug 2016 13:58:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Aug 2016 13:58:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id B43972C0D5F for ; Mon, 8 Aug 2016 13:58:20 +0000 (UTC) Date: Mon, 8 Aug 2016 13:58:20 +0000 (UTC) From: "Anatoli Shein (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HDFS-10679) libhdfs++: Implement parallel find with wildcards tool MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 08 Aug 2016 13:58:22 -0000 [ https://issues.apache.org/jira/browse/HDFS-10679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anatoli Shein updated HDFS-10679: --------------------------------- Attachment: HDFS-10679.HDFS-8707.002.patch Quick fix in async code > libhdfs++: Implement parallel find with wildcards tool > ------------------------------------------------------ > > Key: HDFS-10679 > URL: https://issues.apache.org/jira/browse/HDFS-10679 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: hdfs-client > Reporter: Anatoli Shein > Assignee: Anatoli Shein > Attachments: HDFS-10679.HDFS-8707.000.patch, HDFS-10679.HDFS-8707.001.patch, HDFS-10679.HDFS-8707.002.patch > > > The find tool will issue the GetListing namenode operation on a given directory, and filter the results using posix globbing library. > If the recursive option is selected, for each returned entry that is a directory the tool will issue another asynchronous call GetListing and repeat the result processing in a recursive fashion. > One implementation issue that needs to be addressed is the way how results are returned back to the user: we can either buffer the results and return them to the user in bulk, or we can return results continuously as they arrive. While buffering would be an easier solution, returning results as they arrive would be more beneficial to the user in terms of performance, since the result processing can start as soon as the first results arrive without any delay. In order to do that we need the user to use a loop to process arriving results, and we need to send a special message back to the user when the search is over. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org