streams-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Blackmon (JIRA)" <>
Subject [jira] [Created] (STREAMS-345) LinkCrawler in streams-processor-urls
Date Wed, 01 Jul 2015 01:31:05 GMT
Steve Blackmon created STREAMS-345:

             Summary: LinkCrawler in streams-processor-urls
                 Key: STREAMS-345
             Project: Streams
          Issue Type: Improvement
            Reporter: Steve Blackmon
            Assignee: Steve Blackmon

LinkResolverProcessor can follow links through redirects, tracking status codes and other
metadata, but does not save the content of the page.

Add a processor to the module that retrieves and saves the content of web pages referenced
in the links field or activity object url fields.

This message was sent by Atlassian JIRA

View raw message