incubator-ooo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dave Fisher <>
Subject Re: svn commit: r1336368 - /incubator/ooo/ooo-site/trunk/content/hu/brand.mdtext
Date Thu, 10 May 2012 19:06:18 GMT

On May 10, 2012, at 11:52 AM, Juan C. Sanz wrote:

> El 10/05/2012 20:36, Dave Fisher escribió:
>> On May 10, 2012, at 11:15 AM, sebb wrote:
>>> On 10 May 2012 07:17, Dennis E. Hamilton<>  wrote:
>>>> 1. You can Save as UTF8 in Windows Notepad.
>>>> 2. The BOM (Byte-Order-Mark) is a special two-byte prefix that is usually
used with UTF16 to determine whether the two bytes of each 16-bit code are big-endian or little-endian.
 Some products (including Windows Notepad) also put a BOM on the front of UTF8 streams.  Some
software is hostile to it being there, other software accepts it and recognizes the following
code as UTF8 anyhow. You ran into a hostile case.
>>>> Since you don't see the BOM (it is on front of the file but is not treated
as a character), and you are on Windows, you need a text editor that doesn't produce them
and also discards any that it encounters.
>>>> I recommend an HTML editor.  If you prefer text raw editing, I think jEdit
will work for.  It is a Java application, available on SourceForge.  The Windows version of
gedit will also save text in UTF8.  That probably doesn't produce a BOM.  (It is difficult
to know if one is there or not without looking at the beginning of the file in a hex editor/viewer.)
>>> My copy of Notepad++ (v5.9) has the following Encoding Menu options:
>>> Encode in ANSI
>>> Encode in UTF-8 without BOM
>>> Encode in UTF-8
>>> ...
>>> Convert to ANSI
>>> Convert to UTF-8 without BOM
>>> Convert to UTF-8
>>> ...
>> Thanks! I've added a note to
> In the note you say "If you use Notepad on WIndows you have several options for saving
or converting to UTF. Choose the "UTF-8 without BOM" option" but this is not an option in
notepad (at least in mine) but in Notepad++ which is not the windows one.

Oops, I misread. Let's think about this a bit Notepad is dangerous, but inevitable.

There is a perl routine: read_text_file in

# utility for parsing txt files with headers in them                                     
# and passing the args along to a hashref (in 2nd arg)                                   

sub read_text_file {

If that can be taught to handle the BOM then we won't care.

(I'm removing my note.)


> Regards
> Juan C.
>> Regards,
>> Dave

View raw message