Return-Path: Delivered-To: apmail-repository-archive@www.apache.org Received: (qmail 79552 invoked from network); 4 Mar 2011 10:51:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Mar 2011 10:51:38 -0000 Received: (qmail 64909 invoked by uid 500); 4 Mar 2011 10:51:38 -0000 Delivered-To: apmail-repository-archive@apache.org Received: (qmail 64816 invoked by uid 500); 4 Mar 2011 10:51:38 -0000 Mailing-List: contact repository-help@apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: repository@apache.org List-Id: Delivered-To: mailing list repository@apache.org Received: (qmail 64755 invoked by uid 99); 4 Mar 2011 10:51:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Mar 2011 10:51:38 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of fmeschbe@gmail.com designates 209.85.161.178 as permitted sender) Received: from [209.85.161.178] (HELO mail-gx0-f178.google.com) (209.85.161.178) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Mar 2011 10:51:29 +0000 Received: by gxk25 with SMTP id 25so901429gxk.23 for ; Fri, 04 Mar 2011 02:51:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:subject:from:to:in-reply-to:references :content-type:date:message-id:mime-version:x-mailer :content-transfer-encoding; bh=CA8yjf0EBkcbpwakEt4dVxWOqLJ5tSkQUucubejSAKc=; b=gCyKDKHNhGn8+qTnrKmlkU3seViaSxK3d+vDjKojg3fOw0lprvFUbYtjThCebpRBgK XVjLUgIz6teDqHj2Fn3Vdw/55dQmDqtMttGc6HBkabHngBlsg3gmEzPxLLqRJidaU51b ut88w7BXP6q0GaptD+RpWaDFLpo6nOpvuCTq4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=subject:from:to:in-reply-to:references:content-type:date:message-id :mime-version:x-mailer:content-transfer-encoding; b=R0xyFfgMQOw7OFnJfMT1fwc4CZQveZmv9bciQa5p9ZXmN2R+4QWK905qzTfvGpwlzj kLHhpxNa4VFzUfau1AweNbD19hQepKuMBXKrvdFRs2/ehCMfIy1hxDIxHxHaau6CbPAp c5nJvDnbOW8BuEjQS7q3iQBNozBOng6tGZdpQ= Received: by 10.91.46.17 with SMTP id y17mr506338agj.182.1299234313823; Fri, 04 Mar 2011 02:25:13 -0800 (PST) Received: from [192.168.1.20] (cable-static-182-112.eblcom.ch [87.102.182.112]) by mx.google.com with ESMTPS id c7sm2766982ana.37.2011.03.04.02.25.12 (version=SSLv3 cipher=OTHER); Fri, 04 Mar 2011 02:25:13 -0800 (PST) Subject: Re: Changes on repository.apache.org? From: Felix Meschberger To: repository@apache.org In-Reply-To: References: <4D70936A.2050208@apache.org> <1299232934.2691.2.camel@meschbix> Content-Type: text/plain; charset="UTF-8" Date: Fri, 04 Mar 2011 11:25:10 +0100 Message-ID: <1299234310.2691.3.camel@meschbix> Mime-Version: 1.0 X-Mailer: Evolution 2.30.3 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi, Am Freitag, den 04.03.2011, 10:11 +0000 schrieb Stuart McCulloch: > On 4 March 2011 10:02, Felix Meschberger wrote: > Hi, > > Some more background: These scripts use wget to download the > release > candidate. According to the wget man page wget respects > robots.txt > > > FWIW you could add the following line to your local ~/.wgetrc > > > robots=off > > > this tells wget to ignore robots.txt - the script should then work Thanks for the hint. Carsten found this out, too, and it works indeed. Regards Felix > > which in turn contains: > > > User-agent: * > > Disallow: /content/ > > Disallow: /service/ > > Allow: / > > Allow: /content/sites/ > > Could it be that this prevents wget from working and that > robots.txt has > recently been changed (IIRC I could get a RC with the scripts > on > Monday). > > Thanks and Regards > Felix > > > Am Freitag, den 04.03.2011, 08:23 +0100 schrieb Carsten > Ziegeler: > > > Hi, > > > > in the Felix and Sling project we use a script to download > artifacts > > from the staging repository to verify the releases. > > It stopped working at some point this week. > > > > The script uses wget and fetches index.html and traverses > the links of > > this html page recursively. It seems that now index.html is > not > > available anymore. > > > > While > > > https://repository.apache.org/content/repositories/orgapachefelix-003/org/apache/felix/ > > returns the html > > > https://repository.apache.org/content/repositories/orgapachefelix-003/org/apache/felix/index.html > > > > does not. > > > > Is anyone aware of any changes here? Can we restore the old > behaviour? > > Or does someone know how to instruct wget to not append > index.html (I > > couldn't figure it out) > > > > Regards > > Carsten > > > > > > > -- > Cheers, Stuart