Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9F481F3B3 for ; Mon, 25 Mar 2013 15:32:15 +0000 (UTC) Received: (qmail 36021 invoked by uid 500); 25 Mar 2013 15:32:10 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 35907 invoked by uid 500); 25 Mar 2013 15:32:10 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 35900 invoked by uid 99); 25 Mar 2013 15:32:10 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Mar 2013 15:32:10 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of harsh@cloudera.com designates 209.85.223.182 as permitted sender) Received: from [209.85.223.182] (HELO mail-ie0-f182.google.com) (209.85.223.182) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Mar 2013 15:32:05 +0000 Received: by mail-ie0-f182.google.com with SMTP id at1so3589188iec.27 for ; Mon, 25 Mar 2013 08:31:44 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type:x-gm-message-state; bh=WgYG+ZWX0ttIn6HJUnycr26BD5F5J7GhNvwliUWVlLI=; b=Dwcn3MMWhdRotXRoX/XSpmbjz6lMsQ4WVDtbxuDAZgonpVkg3se5B0qwPNjb9GXnhH 1tDbvIrSH4kWVtpCkKthYJZd081KuyPdYo/BBmKDrKb8xc8amYUTsCMoyOiouEWXFPAc 88QV9pbgCCbMV1oDCY46Ah1xyHuUiPPnQnWIjH5Mw+0SLlqXjTRSQoVQ/RYrDaCrLx/S XdQc2+Yz3brQyuM95s/XQkEfJtjzHwRlpsxQSzoXpsKFHmKHftnuWwG10zPjq3vfvvcO DlodMd+DhnITBZEMDO3NdJSnJQpVunEDIzu+UlPfLSlSPs5nnwY6aVvEfCGB4JC3WVCP lCLA== X-Received: by 10.42.201.73 with SMTP id ez9mr6982044icb.29.1364225504315; Mon, 25 Mar 2013 08:31:44 -0700 (PDT) MIME-Version: 1.0 Received: by 10.50.181.198 with HTTP; Mon, 25 Mar 2013 08:31:24 -0700 (PDT) In-Reply-To: References: From: Harsh J Date: Mon, 25 Mar 2013 21:01:24 +0530 Message-ID: Subject: Re: DataNode heartbeat average time peaks To: "" Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQn4iXVpQ0rFSC94HnSppKWG0y3pprq7wVTJRBotRauf1i7GhBOJ5je9eFGoNqjyMy0V+UKz X-Virus-Checked: Checked by ClamAV on apache.org What's your fsimage size? If its too high you would want to control the checkpoint transfer bandwidth to not affect the load at the NN. This is available via the JIRA HDFS-1457. On Mon, Mar 25, 2013 at 8:13 PM, Ivan Tretyakov wrote: > Hi! > > We see DataNode heartbeat average time peaks in Ganglia up to 20-70 seconds > while SecondaryNameNode performs checkpointing. > See attached screenshots please. > > I would like to clarify if it is Ok, or not. And what kind of consequences > and risks it could bring up. > > -- > Best Regards > Ivan Tretyakov -- Harsh J