jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (JCR-1661) Extend mimetype list of text extractors
Date Wed, 25 Jun 2008 11:45:45 GMT

     [ https://issues.apache.org/jira/browse/JCR-1661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jukka Zitting resolved JCR-1661.

       Resolution: Fixed
    Fix Version/s: 1.5
         Assignee: Jukka Zitting

Patch applied (with some line wraps) in revision 671515. Thanks!

PS. The official list of media types is available at http://www.iana.org/assignments/media-types/index.html
and lists  	application/vnd.ms-excel and application/vnd.ms-powerpoint as the proper media
types for Excel and PowerPoint documents. However, having commonly used aliases recognized
by the text extractors can only help.

> Extend mimetype list of text extractors
> ---------------------------------------
>                 Key: JCR-1661
>                 URL: https://issues.apache.org/jira/browse/JCR-1661
>             Project: Jackrabbit
>          Issue Type: Improvement
>          Components: jackrabbit-text-extractors
>    Affects Versions: 1.5
>            Reporter: Markus K.
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 1.5
>         Attachments: extractor.patch
> Do you think it would be possible to extend the mimetype list of the
> MsPowerpoint and MsExcel textextractors with "application/powerpoint" and
> "application/excel"? 
> It just took me half an hour to figure out why my
> documents didn't turn up in a jackrabbit fulltext-search and maybe other
> users might run into the same problem...
> I'm not sure if there is some kind of standard which lists the possible
> default mimetypes but after a quick google search it seems to me that they
> are not that uncommon.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message