manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Antonio David Pérez Morales (JIRA) <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1071) The windows Shares connector needs some improvement in dates and other fields management
Date Tue, 14 Oct 2014 13:26:34 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14170904#comment-14170904
] 

Antonio David Pérez Morales commented on CONNECTORS-1071:
---------------------------------------------------------

Hi Shinichiro Abe ,
Can you explain me the "Path attribute name/regexp value functionality" ?
Because I am not sure I got your question properly

> The windows Shares connector needs some improvement in dates and other fields management
> ----------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-1071
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1071
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: JCIFS connector
>    Affects Versions: ManifoldCF 1.7
>            Reporter: Antonio David Pérez Morales
>            Assignee: Karl Wright
>            Priority: Minor
>             Fix For: ManifoldCF 2.0
>
>         Attachments: CONNECTORS-1071.patch
>
>
> Right now the connector is overwriting the tika metadata "creation_date" and "last_modification_Date"
for a document. This is happening because at a Windows Shares level you have a creation_date
and a last_modification_date (related to the creation of the document in the windows shares
filesystem) that are different from the creation_date and the last_modification_date associated
to the original file.
> There is the need to change the metadata name to distinguish between this 2 layers of
dates and guaranteeing flexibility to the user to use the one that he/she wants with a proper
mapping.
> A plus can be to format the date in the lucene standard, to be aligned with a proper
standard.
> - Url metadata :
> Can be useful to extract the Url and store it in a specific metadata ( further than the
ID of the document). In this way we can keep it as Id but also use it with other mappings
without affecting the Id field.
> - Parent Directory path :
> Can be useful to extract the Path for the directory that contains the current file. Evaluate
well this as can be a redundancy or an improvement.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message