Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B200CD5AD for ; Tue, 17 Jul 2012 14:15:36 +0000 (UTC) Received: (qmail 71316 invoked by uid 500); 17 Jul 2012 14:15:36 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 71269 invoked by uid 500); 17 Jul 2012 14:15:36 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 71237 invoked by uid 99); 17 Jul 2012 14:15:35 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Jul 2012 14:15:35 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 2B077142856 for ; Tue, 17 Jul 2012 14:15:35 +0000 (UTC) Date: Tue, 17 Jul 2012 14:15:35 +0000 (UTC) From: "Daryn Sharp (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1087592776.64051.1342534535179.JavaMail.jiratomcat@issues-vm> In-Reply-To: <1987133318.59121.1342461274950.JavaMail.jiratomcat@issues-vm> Subject: [jira] [Commented] (HDFS-3671) ByteRangeInputStream shouldn't require the content length header be present MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-3671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416233#comment-13416233 ] Daryn Sharp commented on HDFS-3671: ----------------------------------- Since any fix is a hacky workaround for very old hadoop clusters that violate the http rfc (see below), should we only increase the read timeout if a content-length is missing? Or as I suggested before, use a file stat (or simple HEAD request) if the content-length is missing? The latter will prevent a minimum 200s tail on the transfers. (An http server is required to close the connection when a transfer lacks a content-length is complete in order to avoid this ambiguity on the client side. However it's strongly advised not to leave out the header since the client doesn't know if it only received a partial download, hence my suggestion to fallback to size from a file stat.) > ByteRangeInputStream shouldn't require the content length header be present > --------------------------------------------------------------------------- > > Key: HDFS-3671 > URL: https://issues.apache.org/jira/browse/HDFS-3671 > Project: Hadoop HDFS > Issue Type: Bug > Affects Versions: 2.0.0-alpha > Reporter: Eli Collins > Priority: Blocker > > Per HDFS-3318 the content length header check breaks distcp compatibility with previous releases (0.20.2 and earlier, and 0.21). Like branch-1 this check should be lenient. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira