incubator-connectors-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karl Wright (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-118) Crawled archive files should be expanded into their constituent files
Date Sat, 02 Apr 2011 12:08:06 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13015019#comment-13015019
] 

Karl Wright commented on CONNECTORS-118:
----------------------------------------

This ticket is stalled.
The driver behind it was being able to support a feature that Aperture has.  The way it would
need to be done in ManifoldCF is to have individual connectors deal with the feature.  Each
connector that supports it would know how to generate a specialized URL which referred to
the archive contents, and the document identifiers for such connectors would also need to
be changed to be able to represent archive contents as well.  The connectors under consideration
would be the file system connector, the JCIFS connector, and the Web connector.

> Crawled archive files should be expanded into their constituent files
> ---------------------------------------------------------------------
>
>                 Key: CONNECTORS-118
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-118
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Framework crawler agent
>            Reporter: Jack Krupansky
>
> Archive files such as zip, mbox, tar, etc. should be expanded into their constituent
files during crawling of repositories so that any output connector would output the flattened
archive.
> This could be an option, defaulted to ON, since someone may want to implement a "copy"
connector that maintains crawled files as-is.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message