httpd-bugs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 43562] - PR 21059 is not needed anymore and has degraded the use of mod_speling; please suprress
Date Fri, 05 Oct 2007 16:48:58 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG·
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=43562>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND·
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=43562





------- Additional Comments From ted@w3.org  2007-10-05 09:48 -------
One thing to add to this report is a rather serious side effect this
mod_speling/pathinfo bug inadvertently introduces, infinitely recursive uris. 
Clueless crawlers, which are many, do not notice from the ETAG that they are
getting is identical to a resource they already indexed.  When parsing a
resource for links they accessed with a trailing / and coming across a relative
uri they naturally append the relative path to the uri they already have as a
resource to crawl.

For example:

http://www.w3.org/INSTALL.html/Library/src/Library/src/ --> 200 OK

http://www.w3.org/INSTALL.html/Library/src/Library/src/Library/src/ --> 200 OK

ad nauseam

Except as perhaps a teergrube exercise to trap and hold hostage unsuspecting
crawlers indefinitely this is generally undesirable.

Regards,

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: bugs-unsubscribe@httpd.apache.org
For additional commands, e-mail: bugs-help@httpd.apache.org


Mime
View raw message