jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jukka Zitting <jukka.zitt...@gmail.com>
Subject Using Tika to detect MIME types for WebDAV
Date Mon, 28 Sep 2009 09:04:54 GMT

Our WebDAV server currently uses the simple
o.a.j.server.io.MimeResolver class for automatically inferring the
MIME media types of documents. This class can only detect types based
on file extensions and incoming type hints and the default type table
we include is not very extensive.

We could improve this situation by using the Detector [1] interface
and the related default configuration from Apache Tika. This way we
could massively extend the number of supported types and also get
content-based type detection (magic bytes) almost for free.

The Detector interface and all the related classes and configuration
entries are included in the relatively small tika-core component that
has no external dependencies, so the impact on jackrabbit-jcr-server
dependencies would be reasonable.

[1] https://svn.apache.org/repos/asf/lucene/tika/trunk/tika-core/src/main/java/org/apache/tika/detect/Detector.java


Jukka Zitting

View raw message