Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 62590 invoked from network); 12 Mar 2010 00:01:36 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 12 Mar 2010 00:01:36 -0000 Received: (qmail 75785 invoked by uid 500); 12 Mar 2010 00:01:01 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 75756 invoked by uid 500); 12 Mar 2010 00:01:01 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 75747 invoked by uid 99); 12 Mar 2010 00:01:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Mar 2010 00:01:01 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Mar 2010 00:01:00 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6B76D234C4C3 for ; Fri, 12 Mar 2010 00:00:40 +0000 (UTC) Message-ID: <976984801.215891268352040438.JavaMail.jira@brutus.apache.org> Date: Fri, 12 Mar 2010 00:00:40 +0000 (UTC) From: "Todd Lipcon (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-1034) Enhance datanode to read data and checksum file in parallel In-Reply-To: <1244236374.208931268332467271.JavaMail.jira@brutus.apache.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12844285#action_12844285 ] Todd Lipcon commented on HDFS-1034: ----------------------------------- In practice I don't imagine the extra disk seek for checksums is a problem for HBase - since the checksum file is relatively small, my guess is that it stays hot in the linux buffer cache and therefore doesn't represent any disk access. Would certainly be interesting to run blktrace on a heavily loaded hbase datanode to see if this is true, though! > Enhance datanode to read data and checksum file in parallel > ----------------------------------------------------------- > > Key: HDFS-1034 > URL: https://issues.apache.org/jira/browse/HDFS-1034 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: dhruba borthakur > Assignee: dhruba borthakur > > In the current HDFS implementation, a read of a block issued to the datanode results in a disk access to the checksum file followed by a disk access to the checksum file. It would be nice to be able to do these two IOs in parallel to reduce read latency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.