From dev-return-31937-apmail-forrest-dev-archive=forrest.apache.org@forrest.apache.org Wed Jan 18 03:46:12 2012 Return-Path: X-Original-To: apmail-forrest-dev-archive@www.apache.org Delivered-To: apmail-forrest-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7DC25B585 for ; Wed, 18 Jan 2012 03:46:12 +0000 (UTC) Received: (qmail 43361 invoked by uid 500); 18 Jan 2012 03:46:11 -0000 Delivered-To: apmail-forrest-dev-archive@forrest.apache.org Received: (qmail 43230 invoked by uid 500); 18 Jan 2012 03:46:04 -0000 Mailing-List: contact dev-help@forrest.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: dev@forrest.apache.org List-Id: Delivered-To: mailing list dev@forrest.apache.org Received: (qmail 43220 invoked by uid 99); 18 Jan 2012 03:46:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jan 2012 03:46:01 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jan 2012 03:45:59 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id C51651529D5 for ; Wed, 18 Jan 2012 03:45:39 +0000 (UTC) Date: Wed, 18 Jan 2012 03:45:39 +0000 (UTC) From: "Karl Wright (Commented) (JIRA)" To: dev@forrest.apache.org Message-ID: <1495997949.51558.1326858339820.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <987892131.47772.1326788679769.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (FOR-1231) Forrest does not deal properly with UTF-8 .xml content, even with the proper XML content-type header, and generates corrupted HTML MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/FOR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13188259#comment-13188259 ] Karl Wright commented on FOR-1231: ---------------------------------- bq. Karl, is your system setup to use en_US-UTF-8? bq. export LC_ALL=en_US.UTF-8 bq. export LANG=en_US.UTF-8 bq. export LANGUAGE=en_US.UTF-8 I set the equivalent Windows variables but no change in the generated code for me. So it must be something else. > Forrest does not deal properly with UTF-8 .xml content, even with the proper XML content-type header, and generates corrupted HTML > ---------------------------------------------------------------------------------------------------------------------------------- > > Key: FOR-1231 > URL: https://issues.apache.org/jira/browse/FOR-1231 > Project: Forrest > Issue Type: Bug > Components: Internationalisation (i18n) > Affects Versions: 0.9, 0.10-dev > Reporter: Karl Wright > Priority: Critical > > We're using Forrest to generate the Apache ManifoldCF site. We've added Japanese content. The content worked fine via localhost:8888, but the generated html content does not load properly in a browser, even though the browser correctly divines that the HTML page has utf-8 encoding. It looks like many utf-8 characters in the source XML are handled correctly but some are corrupted. I've also tried the fix in FORREST-668 but this does not help. See http://incubator.apache.org/connectors and click on the tab in Japanese to see what I mean. The current source for the site can be found in: https://svn.apache.org/repos/asf/incubator/lcf/trunk/site. > I checked out latest Forrest trunk and built and used that but there has been no improvement. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira