lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Itziar Cortes <itz...@eleka.net>
Subject Problem indexin accented characters.
Date Sun, 20 Jun 2010 09:20:32 GMT
Hi all!

I have a little problem with CLucene when I try to index accented
characters. I need index characters like ñ, è, ü, or ó. I use Luke to see
the indexed data.

I tried this, and I had no problem:

 pDoc->add(*new Field(_T("field"), _T("a b ñ c d"), Field::STORE_YES |
Field::INDEX_TOKENIZED));


The problem begins when I tried read from a file, and index each line. For
example,

 wifstream file;
 wstring lineread;
 while(std::getline(file, lineread)){
      pDoc->add(*new Field(_T("testua"), lineread.c_str(), Field::STORE_YES
| Field::INDEX_TOKENIZED));

It only index "a" and "b".


How can I solve this problem?

Thanks in advance,

Best regards,

--
Itziar

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message