jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Commented: (JCR-728) Automatic MIME type detection
Date Fri, 02 Feb 2007 09:18:05 GMT

    [ https://issues.apache.org/jira/browse/JCR-728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12469696

Jukka Zitting commented on JCR-728:

> Why LGPL is troublesome?

The LGPL works as intended for C code, but is  troublesome for languages like Java. See http://wiki.apache.org/jakarta/Using_LGPL'd_code
and the current draft of the third party license policy at 
http://www.apache.org/legal/3party.html for more details.

It could be possible for us to introduce a limited LGPL dependency if there's no reasonable
alternative (see the conditions on the Jakarta wiki), but I don't think jmimemagic is essential
enough to justify such trouble. This is also the reason why we can't release the Hibernate
persistence manager we currently have in the contrib directory.

> I know this is a POI issue, but is this project abandoned?

I've seen some activity there, but I don't know the exact status of the project. The latest
Jakarta board report mentioned some conflict over the status of POI, but I hope that's been
cleared. It would be nice if we didn't have to start looking for an alternative.

> Automatic MIME type detection
> -----------------------------
>                 Key: JCR-728
>                 URL: https://issues.apache.org/jira/browse/JCR-728
>             Project: Jackrabbit
>          Issue Type: Improvement
>          Components: indexing
>            Reporter: Jukka Zitting
>            Priority: Minor
> Currently only the jcr:mimeType property is used to determine the MIME type and thus
the applicable text extractor to use for indexing a document. If the jcr:mimeType property
is not available or is set to a generic value like "application/octet-stream", then the indexer
could also use some heuristics based on the node name or magic numbers within the binary stream
to determine the type of the document.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message