jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting" <jukka.zitt...@gmail.com>
Subject Re: FullText Search
Date Tue, 24 Jul 2007 07:56:29 GMT

On 7/24/07, Ishai Borovoy <ishaiborovoy@gmail.com> wrote:
> Is it possible to perform full text search on unstructured node type that
> contains binary file/s (e.g.: word,pdf,excel)?

See https://issues.apache.org/jira/browse/JCR-729 for a related feature request.

Currently Jackrabbit only indexes binary "jcr:data" properties that
have a sibling "jcr:mimeType" property that indicates the relevant
mime type.

There is currently no active effort to implement JCR-729, I guess we
will do that once the incubating Tika project
(http://incubator.apache.org/tika/) or some other project comes up
with a generic library that allows us to avoid having to deal with all
the complexities of automatic mime type detection and various
different parser libraries.


Jukka Zitting

View raw message