www-infrastructure-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Henk Penning (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (INFRA-10859) Allow rsync to list entire contents of dist area
Date Thu, 14 Apr 2016 06:40:25 GMT

    [ https://issues.apache.org/jira/browse/INFRA-10859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240696#comment-15240696

Henk Penning commented on INFRA-10859:

... beats me ; sort of manifest, I think.

It is not unusual to publish an inventory in "ls-lR.gz" or "find-ls.gz" ; it can go into "/dist/zzz".

It would be easy to add to ~apmirror/runmirmon.sh something like :

( cd /dist ; find . \(  -name .svn -prune \) -o -ls | gzip -v -c > zzz/find-ls.gz )

90.0 % compression ; size 0.5 MB ; runtime 0.3 sec.

> Allow rsync to list entire contents of dist area
> ------------------------------------------------
>                 Key: INFRA-10859
>                 URL: https://issues.apache.org/jira/browse/INFRA-10859
>             Project: Infrastructure
>          Issue Type: Improvement
>          Components: Mirrors, Website
>         Environment: ASF mirror hosts
>            Reporter: Sebb
> The ASF mirror hosts has an rsync module which allows most of the contents of the /www/www.apache.org/dist
area to be listed. However some files such as hashes, sigs, and the KEYS files are intentionally
excluded from the module.
> It would be useful to be able to list all the contents of the directory structure.
> It is obviously possible to scrape the web pages, but that is time-consuming, fragile
and uses more resources. Likewise, using SVN to list the folder structure is time consuming,
and it does not show the actual contents which will generally be different (e.g. the .revision
files are not in SVN)
> There is a module that allows listing all the files on minotaur, but it looks like the
module has either not been configured or it disallows listing on the ASF mirror hosts.
> Note: the incubator clutch tool currently relies on being able to list the incubator
files, including the KEYS and sig files. These don't appear in the standard rsync listings,
so the code relies on using find as a cron job on minotaur.

This message was sent by Atlassian JIRA

View raw message