Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A174DDEC4 for ; Fri, 30 Nov 2012 11:50:48 +0000 (UTC) Received: (qmail 76050 invoked by uid 500); 30 Nov 2012 11:50:47 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 75749 invoked by uid 500); 30 Nov 2012 11:50:47 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 75733 invoked by uid 99); 30 Nov 2012 11:50:47 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Nov 2012 11:50:47 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of harsh@cloudera.com designates 209.85.223.176 as permitted sender) Received: from [209.85.223.176] (HELO mail-ie0-f176.google.com) (209.85.223.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Nov 2012 11:50:41 +0000 Received: by mail-ie0-f176.google.com with SMTP id 13so475284iea.35 for ; Fri, 30 Nov 2012 03:50:20 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-gm-message-state; bh=lEUCDO0wOI6CdWDK+ZvLHYta/lrv6wFeoOpxpOUUQSg=; b=hexRZr2c+FgRSM8YvswyLGP3p/d4m0koN4+GdXZSg+SDiRa7tNxbn6DERpZKPHmD34 qhcnCz2vDe/ZtbbVuUFX1KINpDabgEUP88Qx9KrzKXkDYyZc/vCb26OD2yDAYz0t06SM 93zexKlmLLWVl5Cy97rovMMm/2YV8TTADGFJI69wyN38F5RxudNOpwgyjkUbI5krLDbl MTDgAzpxfU2bnpmfKggE0SEI+vcqGORvxaYCVQcZcdBWmPF2H+g3dJIYLXNZ/2Gr2XvR xbZ3wjDe7GfEOHDT9Pav078Hbzp+4jPA/bPBcf6FB6pVCuvbz+/BlJzcFtMjWXmFI+hU 3Tyg== Received: by 10.43.46.2 with SMTP id um2mr688137icb.18.1354276220254; Fri, 30 Nov 2012 03:50:20 -0800 (PST) MIME-Version: 1.0 Received: by 10.64.6.129 with HTTP; Fri, 30 Nov 2012 03:50:00 -0800 (PST) In-Reply-To: References: From: Harsh J Date: Fri, 30 Nov 2012 17:20:00 +0530 Message-ID: Subject: Re: Clarifications on excludedNodeList in DFSClient To: hdfs dev Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQmxfbL95erGyU3DPVBLppGWP5lkFawnKhWTscE7St3QvIy7gWznH4+WZSq8TB6pSVKI0fmc X-Virus-Checked: Checked by ClamAV on apache.org Hi Inder, I didn't see a relevant JIRA on this yet, so I went ahead and filed one at https://issues.apache.org/jira/browse/HDFS-4246 as it seems to affect HBase WALs too (when their blocksizes are configured large, they form a scenario similar to yours), on small clusters or in specific scenarios over large ones. I think a timed-life cache of excludeNodesList would be more preferable than a static, but we can keep it optional (or defaulted to infinite) to not harm the existing behavior. On Tue, Nov 20, 2012 at 11:49 PM, Harsh J wrote: > The excludeNode list is initialized for each output stream created > under a DFSClient instance. That is, it is empty for every new > FS.create() returned DFSOutputStream initially and is maintained > separately for each file created under a common DFSClient. > > However, this could indeed be a problem for a long-running single-file > client, which I assume is a continuously alive and hflush()-ing one. > > Can you search for and file a JIRA to address this with any discussion > taken there? Please put up your thoughts there as well. > > On Mon, Nov 19, 2012 at 3:25 PM, Inder Pall wrote: >> Folks, >> >> i was wondering if there is any mechanism/logic to move a node back from the >> excludedNodeList to live nodes to be tried for new block creation. >> >> In the current DFSClient code i do not see this. The use-case is if the >> write timeout is being reduced and certain nodes get aggressively added to >> the excludedNodeList and the client caches DFSClient then the excludedNodes >> never get tried again in the lifetime of the application caching DFSClient >> >> >> -- >> - Inder >> "You are average of the 5 people you spend the most time with" >> > > > > -- > Harsh J -- Harsh J