jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Cheng Zhang <zhangyongji...@yahoo.com>
Subject Re: search results
Date Mon, 05 Jan 2009 06:52:53 GMT
1.5



----- Original Message ----
From: Christopher M. Logan <chrismikall@gmail.com>
To: users@jackrabbit.apache.org
Sent: Sunday, January 4, 2009 9:59:16 PM
Subject: Re: search results

Cheng,
What Version of Jackrabbit are you running?
-Christopher

Cheng Zhang wrote:
> Thanks, Jukka. Attached please my version of HTMLParser.java.
>
>
> ----- Original Message ----
> From: Jukka Zitting <jukka.zitting@gmail.com>
> To: users@jackrabbit.apache.org
> Sent: Sunday, January 4, 2009 1:48:10 AM
> Subject: Re: search results
>
> Hi,
>
> On Sun, Jan 4, 2009 at 3:08 AM, Cheng Zhang <zhangyongjiang@yahoo.com> wrote:
>  
>> It turns out that the org.apache.jackrabbit.extractor.HTMLParser eats all digits.
>> in method filterAndJoin, all non-letters are removed.
>> Does anybody has any idea why we do so? imo, index "hf100" makes more
>> sense than indexing "hf".
>>    
>
> I don't recall any specific reason why digits should be dropped. I'd
> be happy to apply the fix if you've already fixed this and would like
> to attach the patch to Jira.
>
>  
>> Or is there anyway I can configure to use my HTMLParser instead of the default?
>>    
>
> Look at the textFilterClasses parameter in the <SearchIndex/>
> configuration of your repository.xml and workspace.xml files.
>
> BR,
>
> Jukka Zitting
>  

Mime
View raw message