lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Van Nguyen" <>
Subject Question regarding URL encoding
Date Mon, 17 Jul 2006 19:40:36 GMT
I'm trying to search my index using this search phrase:  1"


That returns zero search results and throws a ParseException: Lexical error at line...  I
can see that 1" is part of that particular document by searching that same document using
a different search term.


How should the Lucene index store characters like that - and characters with accents (foreign
language: á, í, ç, etc)?  Should it be encoded in UTF-8 before and stored that way:


François or Fran%c3%a7ois

1" or 1%22

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message