httpd-docs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vincent Deffontaines <>
Subject Re: Weirdness in the documentation
Date Sat, 18 Jun 2011 21:43:17 GMT
On 18/06/2011 22:29, André Malo wrote:
> * Nick Kew wrote:
>> On 18 Jun 2011, at 20:32, Rich Bowen wrote:
>>> Today I noticed that there's some kind of weirdness with non-printable
>>> characters in the documentation. See attached screenshot.
>>> Was there a recent change to the XSLT files?
>> Doesn't look like it from a glance at the svn.  No change to the XSLT
>> since 2009. More likely the server suddenly started incorrectly sending
>> it as utf-8.
>> However, that does beg the question: shouldn't we move to utf-8
> This begs another question: Why would we want that?

As a french translator of the apache docs, I would love to not be
running scripts like below at commit time, and the revert script when I
want to edit the XML file again. If UTF-8 would prevent us from running
these scripts, this would be a good thing for us. I cannot speak for
German/Korean/other translators, but I must suppose they have met some
similar conversion things.

Vincent Deffontaines

Example script :
sed -e 's/À/\&Agrave;/g' \
    -e 's/Ç/\&Ccedil;/g' \
    -e 's/È/\&Egrave;/g' \
    -e 's/É/\&Eacute;/g' \
    -e 's/Ê/\&Ecirc;/g' \
    -e 's/Î/\&Icirc;/g' \
    -e 's/Ù/\&Ugrave;/g' \
    -e 's/Û/\&Ucirc;/g' \
    -e 's/à/\&agrave;/g' \
    -e 's/â/\&acirc;/g' \
    -e 's/ä/\&auml;/g' \
    -e 's/æ/\&aelig;/g' \
    -e 's/ç/\&ccedil;/g' \
    -e 's/è/\&egrave;/g' \
    -e 's/é/\&eacute;/g' \
    -e 's/ê/\&ecirc;/g' \
    -e 's/ë/\&euml;/g' \
    -e 's/ô/\&ocirc;/g' \
    -e 's/Ô/\&Ocirc;/g' \
    -e 's/î/\&icirc;/g' \
    -e 's/ï/\&iuml;/g' \
    -e 's/ù/\&ugrave;/g' \
    -e 's/û/\&ucirc;/g' \
    -e 's/ü/\&uuml;/g' \

>> How has infra handled multiple charsets in the past? 
> mod_mime, I believe (modulo some exceptional handling for special 
> extensions). Most certainly, it was delivered as iso-8859-1 before, if not 
> specified otherwise via file extension.
> AddDefaultCharset iso-8859-1 or even better an AddCharset iso-8859-1 .html 
> for the manual section should solve the problem.
> nd

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message