manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "vigi (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (CONNECTORS-1204) Import original document file size into Solr
Date Fri, 29 May 2015 09:02:18 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564436#comment-14564436
] 

vigi edited comment on CONNECTORS-1204 at 5/29/15 9:01 AM:
-----------------------------------------------------------

I was able to do it with a Tika metadata transformer. I added a field mapping from 'stream_size'
to my custom Solr field and it seems that it gets the correct file size.

Btw, where could I find all the available Tika metadata fields, it is not very clear to me.


was (Author: vigi):
I was able to do it with a Tika metadata transformer. I added a field mapping from 'stream_size'
to my custom Solr field. It appears to get the correct file size.

Btw, where could I find all the available Tika metadata fields, it is not very clear to me.

> Import original document file size into Solr
> --------------------------------------------
>
>                 Key: CONNECTORS-1204
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1204
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Framework agents process, JCIFS connector, Lucene/SOLR connector
>    Affects Versions: ManifoldCF 2.0.2
>            Reporter: vigi
>            Assignee: Karl Wright
>            Priority: Minor
>              Labels: manifoldcf, outputconnector, solr
>             Fix For: ManifoldCF 1.10, ManifoldCF 2.2
>
>
> When using the Solr output connection, I would like to be able to store the original
file size (in bytes) of the indexed documents into Solr so that it could be displayed in the
search results or it could even be used for searching later on.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message