infra-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Henk Penning (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (INFRA-15707) Spark pairs.com mirror incorrectly includes "Content-Encoding: x-gzip" in header
Date Thu, 29 Nov 2018 12:27:00 GMT

    [ https://issues.apache.org/jira/browse/INFRA-15707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16703100#comment-16703100
] 

Henk Penning commented on INFRA-15707:
--------------------------------------

Ah, I see.

I checked out the files in https://dist.apache.org/repos/dist/release/ [with --depth=files]
;
made the change :
svn diff
Index: .htaccess
===================================================================
--- .htaccess	(revision 31205)
+++ .htaccess	(working copy)
@@ -9,5 +9,5 @@
 AddDescription "Perl project" perl/
 AddDescription "Tcl project" tcl/
 AddDescription "XML project" xml/
-RemoveEncoding .gz .Z
+RemoveEncoding .gz .Z .tgz

.. but I can't commit :

Sending        .htaccess
Transmitting file data .svn: E195023: Commit failed (details follow):
svn: E195023: Changing file '/home/henkp/svn/dist/.htaccess' is forbidden by the server
svn: E175013: Access to '/repos/dist/!svn/txr/31205-qcn/release/.htaccess' forbidden
svn: E175002: Additional errors:
svn: E175002: PUT of '/repos/dist/!svn/txr/31205-qcn/release/.htaccess': 403 Forbidden


> Spark pairs.com mirror incorrectly includes "Content-Encoding: x-gzip" in header
> --------------------------------------------------------------------------------
>
>                 Key: INFRA-15707
>                 URL: https://issues.apache.org/jira/browse/INFRA-15707
>             Project: Infrastructure
>          Issue Type: Bug
>          Components: Mirrors
>            Reporter: John Brock
>            Assignee: Henk Penning
>            Priority: Major
>
> Not sure if INFRA is the right place for this.
> Browsers like Chrome (and possibly curl?) will automatically decompress anything sent
over the wire when the header has "Content-Encoding: x-gzip". This means downloads of Spark
from the pairs.com mirror get unzipped automatically, so what ends up on your machine is just
the tar (but with the original filename intact, even if it ends in .gz). This also means that
trying to validate the checksum of the download will mysteriously fail, since the posted checksums
are for the .tar.gz, not the .tar.
> See SPARK-22851 for reference.
> Relevant example:
> > curl -I http://apache.mirrors.pair.com/spark/spark-2.2.1/spark-2.2.1-bin-hadoop2.7.tgz
> HTTP/1.1 200 OK
> Date: Thu, 21 Dec 2017 23:46:58 GMT
> Server: Apache/2.2.29
> Last-Modified: Sat, 25 Nov 2017 02:44:26 GMT
> ETag: "32b662-bfa03c4-55ec5a5c358a1"
> Accept-Ranges: bytes
> Content-Length: 200934340
> Content-Type: application/x-tar
> Content-Encoding: x-gzip



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message