manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "vigi (JIRA)" <>
Subject [jira] [Commented] (CONNECTORS-1204) Import original document file size into Solr
Date Fri, 29 May 2015 08:58:17 GMT


vigi commented on CONNECTORS-1204:

I was able to do it with a Tika metadata transformer. I added a field mapping from 'stream_size'
to my custom Solr field. It appears to get the correct file size.

Btw, where could I find all the available Tika metadata fields, it is not very clear to me.

> Import original document file size into Solr
> --------------------------------------------
>                 Key: CONNECTORS-1204
>                 URL:
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Framework agents process, JCIFS connector, Lucene/SOLR connector
>    Affects Versions: ManifoldCF 2.0.2
>            Reporter: vigi
>            Assignee: Karl Wright
>            Priority: Minor
>              Labels: manifoldcf, outputconnector, solr
>             Fix For: ManifoldCF 1.10, ManifoldCF 2.2
> When using the Solr output connection, I would like to be able to store the original
file size (in bytes) of the indexed documents into Solr so that it could be displayed in the
search results or it could even be used for searching later on.

This message was sent by Atlassian JIRA

View raw message