Mailing-List: contact general-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: general@lucene.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <3F3DB3D86C7846D994C93437EDE6BB88@hp6690ej01>
References: <AANLkTil9AcgtfUx_c_Su1VjhavUmPFWDhFdRkfsfolPk@mail.gmail.com>
	<3F3DB3D86C7846D994C93437EDE6BB88@hp6690ej01>
Date: Mon, 21 Jun 2010 08:05:41 +0200
Message-ID: <AANLkTimKBaNI7b6TPXBrE99EURFokfkyARXQE0iRzbto@mail.gmail.com>
Subject: Re: Problem indexin accented characters.
From: Itziar Cortes <itziar@eleka.net>
To: general@lucene.apache.org
Cc: clucene-developers@lists.sourceforge.net
Content-Type: multipart/alternative; boundary=0015175888962d8448048984188c

--0015175888962d8448048984188c
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi!

Thanks for the reply.

I supposed the problem could be encoding problem... but I am sure that the
file is reading correctly.

Generally I have a problem when I tried to index a variable.

Could you tell me where can I post this question in CLucene user group? Is
that a mailing list?

Thanks in advance,

--
Itziar

2010/6/20 Itamar Syn-Hershko <itamar@code972.com>

> Looks like an encoding issue. Is the file being read correctly (check wit=
h
> your debugger)?
>
> Also, please post such questions to the CLucene user group.
>
> Itamar.
>
> > -----Original Message-----
> > From: Itziar Cortes [mailto:itziar@eleka.net]
> > Sent: Sunday, June 20, 2010 12:21 PM
> > To: general@lucene.apache.org
> > Subject: Problem indexin accented characters.
> >
> > Hi all!
> >
> > I have a little problem with CLucene when I try to index
> > accented characters. I need index characters like =F1, =E8, =FC, or
> > =F3. I use Luke to see the indexed data.
> >
> > I tried this, and I had no problem:
> >
> >  pDoc->add(*new Field(_T("field"), _T("a b =F1 c d"),
> > Field::STORE_YES | Field::INDEX_TOKENIZED));
> >
> >
> > The problem begins when I tried read from a file, and index
> > each line. For example,
> >
> >  wifstream file;
> >  wstring lineread;
> >  while(std::getline(file, lineread)){
> >       pDoc->add(*new Field(_T("testua"), lineread.c_str(),
> > Field::STORE_YES
> > | Field::INDEX_TOKENIZED));
> >
> > It only index "a" and "b".
> >
> >
> > How can I solve this problem?
> >
> > Thanks in advance,
> >
> > Best regards,
> >
> > --
> > Itziar
> >
>
>

--0015175888962d8448048984188c--