jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Wechner <michael.wech...@wyona.com>
Subject Re: Indexing of properties setProperty(String, InputStream)
Date Mon, 26 Nov 2007 10:42:56 GMT
Marcel Reutegger wrote:

> Hi Michael,
>
> these are rather questions for the user list, but anyway...


sorry for that. Will subscribe to the user mailing list now.

Cheers

Michael

>
> Michael Wechner wrote:
>
>> I am using setProperty(String, InputStream) resp. 
>> setProperty("content", new InputStream(...)) in order to save XHTML 
>> and other "bigger" content.
>> Also I am using the TransientRepository implementation.
>>
>> When I am searching with xpath, something like //*[@content] then I 
>> don't receive any results whereas properties being set with 
>> setProperty(String, String) are being found.
>>
>> Now I am very sure the "content" properties do exist, because I read 
>> and write to them without a problem.
>>
>> So my guess is that properties being set through setProperty(String, 
>> InputStream) are not being indexed by default, because it could be 
>> any kind of data, right?
>
>
> that's correct. the JCR specification says that binary properties are 
> not indexed. basically because of the reason you mentioned. it can be 
> anything...
>
>> But I can get them indexed?
>
>
> yes, if you store the binary as a nt:resource node. this will give 
> jackrabbit the required information how to index the binary (mime-type 
> and encoding). furthermore you need to configure text extractors in 
> the configuration. 
> http://jackrabbit.apache.org/doc/components/text-extractors.html
>
>> Shall I rather use
>> setProperty(String, Value, int) and set the type to String and use 
>> Value.getStream() ?
>
>
> that's an alternative, but then you will get matches for tag names as 
> well. while you are probably only interested in the text between the 
> elements and attribute values.
>
> regards
>  marcel



-- 
Michael Wechner
Wyona      -   Open Source Content Management - Yanel, Yulup
http://www.wyona.com
michael.wechner@wyona.com, michi@apache.org
+41 44 272 91 61


Mime
View raw message