cocoon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeff Turner <je...@apache.org>
Subject CLI question: how are links retrieved?
Date Sat, 17 May 2003 14:45:59 GMT
Hi,

A question for those clueful about how the reworked command-line
(CocoonBean etc) works..

I gather that some speedups were made by not requesting each page 3 times
(for content, links, and something else..).  Is the 'links' view as
defined in the sitemap still used when crawling pages?

It would seem not, because I can completely delete the <map:view
name="links"..> section, or corrupt its transformer @src, and the CLI
still retrieves links from pages.

So how are the links being extracted from the page?  Sorry, I should dig
through the code but a weekend has only so many hours.

The problem I'm having with Forrest is that:

1) site: and ext: links need to be rewritten by a transformer before the
   CLI can follow them.  They are not rewritten with the new CLI, causing
   broken links.
2) the filterlinks.xsl stylesheet, used only in the 'links' view
   pipeline, is required to filter out unwanted links, and this isn't
   being called.


Perhaps as a result of 1), I get lots of these stacktraces:

java.lang.NullPointerException
        at org.apache.cocoon.environment.AbstractEnvironment.release(AbstractEnvironment.java:511)
        at org.apache.cocoon.generation.FileGenerator.recycle(FileGenerator.java:90)
        at org.apache.avalon.excalibur.pool.ResourceLimitingPool.put(ResourceLimitingPool.java:438)
        at org.apache.avalon.excalibur.component.PoolableComponentHandler.doPut(PoolableComponentHandler.java:245)
        at org.apache.avalon.excalibur.component.ComponentHandler.put(ComponentHandler.java:452)
        at org.apache.avalon.excalibur.component.ExcaliburComponentSelector.release(ExcaliburComponentSelector.java:337)
        at org.apache.cocoon.components.ExtendedComponentSelector.release(ExtendedComponentSelector.java:316)
        at org.apache.cocoon.components.ExtendedComponentSelector.release(ExtendedComponentSelector.java:313)
        at org.apache.cocoon.components.pipeline.AbstractProcessingPipeline.recycle(AbstractProcessingPipeline.java:637)
        at org.apache.cocoon.components.pipeline.impl.AbstractCachingProcessingPipeline.recycle(AbstractCachingProcessingPipeline.java:932)
        at org.apache.avalon.excalibur.pool.ResourceLimitingPool.put(ResourceLimitingPool.java:438)
        at org.apache.avalon.excalibur.component.PoolableComponentHandler.doPut(PoolableComponentHandler.java:245)
        at org.apache.avalon.excalibur.component.ComponentHandler.put(ComponentHandler.java:452)
        at org.apache.avalon.excalibur.component.ExcaliburComponentSelector.release(ExcaliburComponentSelector.java:337)
        at org.apache.cocoon.components.ExtendedComponentSelector.release(ExtendedComponentSelector.java:316)
        at org.apache.cocoon.components.EnvironmentDescription.release(CocoonComponentManager.java:496)
        at org.apache.cocoon.components.CocoonComponentManager.endProcessing(CocoonComponentManager.java:189)
        at org.apache.cocoon.Cocoon.process(Cocoon.java:648)
        at org.apache.cocoon.bean.CocoonBean.getPage(CocoonBean.java:992)
        at org.apache.cocoon.bean.CocoonBean.processURI(CocoonBean.java:728)
        at org.apache.cocoon.bean.CocoonBean.process(CocoonBean.java:524)
        at org.apache.cocoon.bean.CocoonBean.process(CocoonBean.java:472)
        at org.apache.cocoon.Main.main(Main.java:423)


Thanks,

--Jeff


Mime
View raw message