Return-Path: Delivered-To: apmail-httpd-dev-archive@www.apache.org Received: (qmail 38684 invoked from network); 11 Oct 2005 14:06:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 Oct 2005 14:06:01 -0000 Received: (qmail 13155 invoked by uid 500); 11 Oct 2005 14:05:56 -0000 Delivered-To: apmail-httpd-dev-archive@httpd.apache.org Received: (qmail 13085 invoked by uid 500); 11 Oct 2005 14:05:55 -0000 Mailing-List: contact dev-help@httpd.apache.org; run by ezmlm Precedence: bulk Reply-To: dev@httpd.apache.org list-help: list-unsubscribe: List-Post: List-Id: Delivered-To: mailing list dev@httpd.apache.org Received: (qmail 13074 invoked by uid 99); 11 Oct 2005 14:05:55 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Oct 2005 07:05:55 -0700 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=URI_NO_WWW_ANY_CGI X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [66.111.4.27] (HELO out3.smtp.messagingengine.com) (66.111.4.27) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Oct 2005 07:05:57 -0700 Received: from frontend1.internal (mysql-sessions.internal [10.202.2.149]) by frontend1.messagingengine.com (Postfix) with ESMTP id 3F7DACD17D3 for ; Tue, 11 Oct 2005 10:05:32 -0400 (EDT) Received: from frontend2.messagingengine.com ([10.202.2.151]) by frontend1.internal (MEProxy); Tue, 11 Oct 2005 10:05:32 -0400 X-Sasl-enc: SyzUWAc3evDHNjJ34Cd9XMeDIBtC7Mw9ke4GcQLtoVY/ 1129039531 Received: from [132.211.187.132] (unknown [132.211.187.132]) by frontend2.messagingengine.com (Postfix) with ESMTP id 87318570394 for ; Tue, 11 Oct 2005 10:05:31 -0400 (EDT) Message-ID: <434BC6AF.90906@slive.ca> Date: Tue, 11 Oct 2005 10:05:35 -0400 From: Joshua Slive User-Agent: Thunderbird 1.4 (Windows/20050908) MIME-Version: 1.0 To: dev@httpd.apache.org Subject: Re: nofollow was Re: mod_mbox References: <434AEF61.90906@slive.ca> <434B50A2.4020808@force-elite.com> <434B7B1E.6030008@webthing.com> In-Reply-To: <434B7B1E.6030008@webthing.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Nick Kew wrote: > Paul Querna wrote: >>> 2. There are several formats for each mail message (regular, raw, >>> mime). Probably the links to everything other than the standard >>> format should use the rel="nofollow" modifier to keep the search >>> engines out. Keeping the robots off of 2/3 of the links could make >>> a big difference in load considering the number of pages on this site. >> >> >> I agree. We don't want Google and friends indexing the raw format, and >> then ranking it higher than the normal presentation. > > More importantly, any mail archive without nofollow in the messages > becomes a spam magnet. Here's some nice free googlerank for > http://dodgy.pills.example.com/?refid=yourstruly Well, we don't want to keep search engines out of the archive entirely. The archives are a huge resource that we want easily searchable. But we need to start thinking about a way to remove specific messages from our archives for this reason among others. That is more a topic for infrastructure@ Joshua.