forrest-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nicola Ken Barozzi <nicola...@apache.org>
Subject Re: Link Crawling?
Date Mon, 04 Nov 2002 15:36:11 GMT

Vadim Gritsenko wrote:
> Nicola Ken Barozzi wrote:
>>
>> Peter Donald wrote:
>> > Hi,
>> >
>> > Is there anyway I can add more strategies for link crawling during CLI
>> > operation? In particular I have a css sheet that has
>> >
>> > @import url("blah.css");
>> >
>> > but this wont ever be copied across because it is not crawled.
>> >
>> > Suggestions?
>>
>> Basically, the whole Cocoon CLI system has been hacked away by Stefano
>> and also Gianugo, and not much touched since then.
>>
>> It has been neglected for long, and as you know too well from the use
>> you made on Avalon site, it stopped at every single problem with links
>> it had, which BTW has never been the intention of the original writers.
>>
>> Lately I have tweaked it to output better info to the user and not to
>> break on broken links.
>> It still needs more work though.
>>
>> For now you have two options: include that link in the html as an
>> attribute to a tag (try <!-- <a href="blah.css"/> --> ) or patch the
>> Cocoon CLI which is Main.java and many other classes.
> 
> Actually, whole link extraction logic is in LinkSerializer and its parents.

Actually there is some link processing in Main.java, look at

   public Collection processURI(String uri) throws Exception {...

to see what I mean.

-- 
Nicola Ken Barozzi                   nicolaken@apache.org
             - verba volant, scripta manent -
    (discussions get forgotten, just code remains)
---------------------------------------------------------------------


Mime
View raw message