Hi,
On 7/24/07, Ishai Borovoy <ishaiborovoy@gmail.com> wrote:
> Is it possible to perform full text search on unstructured node type that
> contains binary file/s (e.g.: word,pdf,excel)?
See https://issues.apache.org/jira/browse/JCR-729 for a related feature request.
Currently Jackrabbit only indexes binary "jcr:data" properties that
have a sibling "jcr:mimeType" property that indicates the relevant
mime type.
There is currently no active effort to implement JCR-729, I guess we
will do that once the incubating Tika project
(http://incubator.apache.org/tika/) or some other project comes up
with a generic library that allows us to avoid having to deal with all
the complexities of automatic mime type detection and various
different parser libraries.
BR,
Jukka Zitting
|