Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2D7D218683 for ; Sun, 2 Aug 2015 11:58:05 +0000 (UTC) Received: (qmail 17370 invoked by uid 500); 2 Aug 2015 11:58:04 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 17308 invoked by uid 500); 2 Aug 2015 11:58:04 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 17294 invoked by uid 99); 2 Aug 2015 11:58:04 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 02 Aug 2015 11:58:04 +0000 Date: Sun, 2 Aug 2015 11:58:04 +0000 (UTC) From: "Surendra Singh Lilhore (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HDFS-247) A tool to plot the locations of the blocks of a directory MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Surendra Singh Lilhore updated HDFS-247: ---------------------------------------- Assignee: (was: Surendra Singh Lilhore) > A tool to plot the locations of the blocks of a directory > --------------------------------------------------------- > > Key: HDFS-247 > URL: https://issues.apache.org/jira/browse/HDFS-247 > Project: Hadoop HDFS > Issue Type: New Feature > Reporter: Owen O'Malley > Labels: newbie > > It would be very useful to have a command that we could give a hdfs directory to, that would use fsck to find the block locations of the data files in that directory and group them by host and display the distribution graphically. We did this by hand and it was very for finding a skewed distribution that was causing performance problems. The tool should also be able to group by rack id and generate a similar plot. -- This message was sent by Atlassian JIRA (v6.3.4#6332)