forrest-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ferdinand Soethe <samm...@soethe.net>
Subject Encoding-atribute in site.xml (copied from user list)
Date Tue, 10 May 2005 05:21:45 GMT

Ross Gardler wrote:

RG> Roland Becker wrote:

>> Is there e reason, that "site.xml" after "forrest seed" has
>> no encoding-atribute?
>> 
>> If there is a german umlaut in "site.xml", then build fails with:
>> 
>> linkmap.html  BROKEN: Invalid byte 2 of 2-byte UTF-8 sequence.
>> BUILD FAILED
>> C:\programme\apache-forrest-0.6\src\core\targets\site.xml:43: Java returned: 1

RG> No reason that I am aware of or can imagine. I've updated the file in
RG> site.xml for the next release.


Funny, I have been using Umlaute with 0.7 head all over my site.xml
and never had a problem compiling it. See below:

<?xml version="1.0"?>

> <site label="Bildungsverein Hannover" href="" xmlns="http://apache.org/forrest/linkmap/1.0"
tab="home">
>
>   <about label="Über uns" tab="home">
>     <über_uns label="Wir über uns" href="index.html"/>
>     <Dozentinnen label="Dozent/innen" href="dozentinnen.html"/>
>     <Lernorte label="Lernorte" href="lernorte.html"/>
>     <agbs label="Geschäftsbedingungen" href="agbs.html"/>
>     <newsletter label="Newsletter" href="newsletter.html"/>
>     <kontakt label="Kontakt" href="kontakt.html"/>
>     <impressum label="Impressum" href="impressum.html"/>
>     <wegweiser label="Wegweiser" href="Wegweiser.html"/>
>   </about>

I assumed that somebody had fixed the problem in the meantime but never
checked. Now I did and found that Eclipse understands site to be utf-8
even though there is not explicit declaration.

Perhaps we should append the issue and the FAQ entry (How to use
special characters in the labels of the site.xml file?) referring to
the problem and explain that you _can_ use special characters directly
if you use the standard utf-8 encoding for site.xml (and make sure
your editor knows it).

Still unclear to me: Am I correct that this is an _editor problem_ in
the sense that no encoding-attribute means the file is utf-8 but
Roland's editor needed a stronger hint?

Thanks,
Ferdinand Soethe


Mime
View raw message