httpd-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Behlendorf <br...@collab.net>
Subject Re: recursive robot queries
Date Tue, 02 Jan 2001 02:10:46 GMT
On Sun, 31 Dec 2000, Roy T. Fielding wrote:
> > I narrowed it down to this sequence of accesses from that host:
> > 
> > httpd.apache.org 210.73.88.163 - - [31/Dec/2000:08:07:15 -0800] "GET /docs/misc/known_client_problems.html
HTTP/1.0" 200 13973 "http://httpd.apache.org/docs/misc/compat_notes.html" "Wget/1.5.3"
> > www.apache.org 210.73.88.163 - - [31/Dec/2000:08:07:25 -0800] "GET /index/full/4118
HTTP/1.0" 200 3785 "http://httpd.apache.org/docs/misc/known_client_problems.html" "Wget/1.5.3"
> > www.apache.org 210.73.88.163 - - [31/Dec/2000:08:07:26 -0800] "GET /index/full/foundation/images/asf_logo.gif
HTTP/1.0" 200 3785 "http://www.apache.org:80/index/full/4118" "Wget/1.5.3"
> 
> I don't think so -- the presence of www.apache.org:80 would seem to indicate
> that something on our side did a redirect using the default hostname instead
> of using bugs.apache.org.  

I am pretty sure I didn't skip a request, though; the chain pretty clearly
went

http://httpd.apache.org/docs/misc/compat_notes.html
http://httpd.apache.org/docs/misc/known_client_problems.html
http://www.apache.org:80/index/full/4118

There are intervening HEAD requests for each resource right before a
GET; hmm.  Does this look right?  Could this be confusing Wget?

[taz] 6:08pm primary > telnet bugs.apache.org 80
Trying 64.208.42.41...
Connected to bugs.apache.org.
Escape character is '^]'.
HEAD /index/full/4118 HTTP/1.0
Host: bugs.apache.org

HTTP/1.1 302 Found
Date: Tue, 02 Jan 2001 02:08:48 GMT
Server: Apache/1.3.15-dev (Unix) tomcat/1.0
Location: http://bugs.apache.org/index.cgi/full/4118
Connection: close
Content-Type: text/html; charset=iso-8859-1
Connection closed by foreign host.



	Brian




Mime
View raw message