cocoon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bernhard Huber <>
Subject Re: Crawler/Indexer redesign
Date Sat, 02 Feb 2002 21:26:38 GMT

>How about 
>  Collection crawl(Source)
>? Then crawler can be ThreadSafe.
Yes, it would be ThreadSafe, storing all crawled resources in the 
Does this work for crawling huge sites?

My idea was to handle that problem by introducing the Iterator.
Using Iterator might allow to process some crawled resources quite early.
Using collection might delay the processing of the crawled resources 
until the crawling has terminated,
that might take quite some time.

Hence it might be better:
Iterator crawl( Source)

bye bernhard

To unsubscribe, e-mail:
For additional commands, email:

View raw message