manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl (JIRA) <j...@apache.org>
Subject [jira] [Created] (CONNECTORS-243) Web crawler must get the "Last-Modified" HTTP header and pass it as metadata to output
Date Mon, 22 Aug 2011 15:10:29 GMT
Web crawler must get the "Last-Modified" HTTP header and pass it as metadata to output
--------------------------------------------------------------------------------------

                 Key: CONNECTORS-243
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-243
             Project: ManifoldCF
          Issue Type: New Feature
          Components: Web connector
    Affects Versions: ManifoldCF 0.2
            Reporter: Jan Høydahl


Last-Modified is important in web search, at it may be used for (de)boosting based on date.
In fact, ManifoldCF should have the ability to parse any (or all) HTTP headers from source
document and pass it on.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Mime
View raw message