Return-Path: Delivered-To: apmail-forrest-dev-archive@www.apache.org Received: (qmail 24446 invoked from network); 23 Jul 2005 23:53:39 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 23 Jul 2005 23:53:39 -0000 Received: (qmail 19143 invoked by uid 500); 23 Jul 2005 23:53:38 -0000 Delivered-To: apmail-forrest-dev-archive@forrest.apache.org Received: (qmail 19118 invoked by uid 500); 23 Jul 2005 23:53:38 -0000 Mailing-List: contact dev-help@forrest.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: dev@forrest.apache.org List-Id: Delivered-To: mailing list dev@forrest.apache.org Received: (qmail 19105 invoked by uid 99); 23 Jul 2005 23:53:38 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 23 Jul 2005 16:53:38 -0700 X-ASF-Spam-Status: No, hits=1.4 required=10.0 tests=DATE_IN_FUTURE_96_XX X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [217.199.181.91] (HELO ns3.wkwyw.net) (217.199.181.91) by apache.org (qpsmtpd/0.29) with SMTP; Sat, 23 Jul 2005 16:53:31 -0700 Received: (qmail 20375 invoked from network); 23 Jul 2005 23:53:34 -0000 Received: from 82-69-78-226.dsl.in-addr.zen.co.uk (HELO ?192.168.0.4?) (82.69.78.226) by ns3.wkwyw.net with SMTP; 23 Jul 2005 23:53:34 -0000 Message-ID: <430BC504.2050903@apache.org> Date: Wed, 24 Aug 2005 02:53:24 +0200 From: Ross Gardler User-Agent: Mozilla Thunderbird 1.0 (Windows/20041206) X-Accept-Language: en-us, en MIME-Version: 1.0 To: dev@forrest.apache.org Subject: Re: Add support for Googles sitemap protocol? References: <80d298bc05071311181b5c35b8@mail.gmail.com> <42D584CB.30300@apache.org> <42D5960F.3050507@apache.org> <80d298bc050713161262ca8f21@mail.gmail.com> <42D5A724.4000808@apache.org> <80d298bc0507132243420ef815@mail.gmail.com> <42D610BF.6020006@apache.org> <80d298bc05072212313281e75c@mail.gmail.com> In-Reply-To: <80d298bc05072212313281e75c@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Rasik Pandey wrote: > Hello, > > Ross wrote: > >>http://svn.apache.org/repos/asf/cocoon/trunk/src/java/org/apache/cocoon/generation/LinkStatusGenerator.java > > > > would do the trick, although it would have to be modified to make a > call > > > to get the "last-modified" header, so hopefully we could get that added > > > to a future release of cocoon. With a quick examination of the > code, it > > > looks like it will crawl a URL and generate an xml report, allowing > > > includes and excludes expressions. > > > > We can create a new generator that extands the LinkStatusGenerator and > > house it here. If Cocoon want it we will remove it from here at a later > > date. > > I have a few issues with the approach which I proposed, hopefully you > can help me make some sense about my reservations. First, since views > are orthogonal to each sitemap the cocoon-view=links required by the > LinkStatusGenerator and therefore must be declared in the parent sitemap > which does the core matching. Agghhhh!!! There's that term "views" again. We have a real problem here in Forrest at the moment. Views are being used to refer to two different things (views in Eclipse and views, the replacement for skins). Now we seem to have a third use, I assume this is sitemap views (funnily enough I said this may become a conflict in another mail earlier today, never thought it would already have happened in my inbox). I'm going to delay responding to this because one of the things that happened at the Hackathon is that Ferdinand looked at an alternative method of identifying which pages were regenerated in a run. Perhaps we should wait to see if he thinks it can be applied here. > Secondly, in building such a plugin for > forrest, the plugin-sitemap would have to override/redeclare a number of > pipeline match(s) in order to have access/provide to the necessary > request header,"last-modified", as there doesn't seem to be a generic > means for providing this information and passing it up to parent > pipeline match(s) so they are conditionally added to the request > headers. Overriding many pipelineThis doesn't seem very maintainable. > Would anyone have any ideas on how to achieve this or an alternative > approach? Internal plugins are for this kind of thing. However, as they stand they don't make the code much more maintainable. A possible solution is to use Cocoons new block sitemap loading features. This was demonstrated to me at the Hackathon (thanks Daniel), it provides a way for plugins to extend existing sitemaps (and other plugins). I need to do some experimentation with this so you should proceed with an internal plugin for now, we'll address the maintainability when experiments are complete. Ross