jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting" <jukka.zitt...@gmail.com>
Subject Re: Lucene Analyzerr
Date Sat, 16 Feb 2008 22:08:07 GMT

2008/2/15 Hamid Reza Sahlolbey <sahlolbey@gmail.com>:
> First I used StandardAnalyzer but when I looked in workspace index files I
> recognized that I it doesn't index Persian text so I change to
> SimpleAnalyzer, Now it seems that it index Persian text right, but don't
> find it(Consider that the query is the same for Msword and pdf files).

Could there be some character encoding confusion somewhere? You may
want to check that the Unicode character stream produced by the text
extractor looks valid.


Jukka Zitting

View raw message