lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pascal Nadal <Pascal.Na...@Keyword.fr>
Subject Wildcard search and HOST tokens
Date Wed, 12 Nov 2003 10:55:23 GMT
My lucene indexes contain fields with values like this  www.xxx.yyy.zzz
which are treated as HOST tokens.
My problem is the following : search results never contain documents with
such fields when doing a wildcard query or a fuzzy query. Only searches on
full field values work.
 
example queries: www*  www.* www.xxx* www?xxx?yyy www.yyy.y~ or just yyy
 
I'm using Lucene 1.2 and the StandardAnalyzer. It seems that the '.' is the
problem.
 
Is it a bug ?
 
I wrote a HostFilter class which tokenizes again HOST tokens and it seems to
work fine (full field values or wildcard queries)
 

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message