lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dario Novakovic" <dario...@hotmail.com>
Subject Re: setting encoding
Date Tue, 21 May 2002 13:10:41 GMT
actualy, there is no need to set encoding. i only need to read files using 
proper decoding and then lucene stores it index properly, so when i retrive 
docs, they are proper strings with letters with accents.

i tought it can't be so simple. the whole thing is in reading and decoding, 
lucene takes care of the rest.

thanks everybody for suggestions

dario




>From: "redpineseed" <redpineseed@telus.net>
>Reply-To: "Lucene Users List" <lucene-user@jakarta.apache.org>
>To: "Lucene Users List" <lucene-user@jakarta.apache.org>
>Subject: Re: setting encoding
>Date: Mon, 20 May 2002 13:29:58 -0700
>
>
>convert your native code to unicode (UTF16) with the following lines:
>
>File f = new File('cp1252_input');
>FileInputStream tmp = new FileInputStream(f);
>BufferedReader  brin = new BufferedReader( new InputStreamReader( tmp, 
>"CP1252"));
>String inputString = brin.readLine();
>
>not sure your code designater is CP1252, chech that out in Java Docs.
>
>
>redpineseed


_________________________________________________________________
Chat with friends online, try MSN Messenger: http://messenger.msn.com


--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message