lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From eou <...@free.fr>
Subject CHE DONG
Date Fri, 19 Oct 2001 10:07:10 GMT
Hello

if you wanna indexing digit and letter mixed words just change LowerCaseTokinzer.java in com/lucene/analysis/
line 62:
      if (Character.isLetter(c)) { 
=>   if (Character.isLetterOrDigit(c)) { 
then the digit and letter mixed words, like "U2", "fifa98" will tokened and indexed as one
word.
I think it should be default for there too much words in the world now: like "fifa98", telphone
number etc.


Che Dong  



Mime
View raw message