From user-return-4576-apmail-forrest-user-archive=forrest.apache.org@forrest.apache.org Tue Jun 17 11:50:03 2008 Return-Path: Delivered-To: apmail-forrest-user-archive@www.apache.org Received: (qmail 54002 invoked from network); 17 Jun 2008 11:50:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 17 Jun 2008 11:50:01 -0000 Received: (qmail 36319 invoked by uid 500); 17 Jun 2008 11:49:57 -0000 Delivered-To: apmail-forrest-user-archive@forrest.apache.org Received: (qmail 36265 invoked by uid 500); 17 Jun 2008 11:49:56 -0000 Mailing-List: contact user-help@forrest.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: List-Post: Reply-To: user@forrest.apache.org List-Id: Delivered-To: mailing list user@forrest.apache.org Received: (qmail 36105 invoked by uid 99); 17 Jun 2008 11:49:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Jun 2008 04:49:55 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of thorsten.scherler.ext@juntadeandalucia.es designates 217.12.18.114 as permitted sender) Received: from [217.12.18.114] (HELO mta.juntadeandalucia.es) (217.12.18.114) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Jun 2008 11:49:06 +0000 Received: from [10.240.225.254] (helo=mail.juntadeandalucia.es) by guadix2.juntadeandalucia.es with esmtps (TLSv1:AES256-SHA:256) (Exim 4.60) (envelope-from ) id 1K8Zgh-0002ro-3t for user@forrest.apache.org; mar, 17 jun 2008 13:49:23 +0200 Received: from [10.240.192.30] by mail.juntadeandalucia.es with esmtpa (Exim 4.69) (envelope-from ) id 1K8Zgh-0005Mg-8e for user@forrest.apache.org; Tue, 17 Jun 2008 13:49:23 +0200 Subject: Re: Japanese characters transform from doc to html From: Thorsten Scherler To: user@forrest.apache.org In-Reply-To: References: Content-Type: text/plain Date: Tue, 17 Jun 2008 13:48:28 +0200 Message-Id: <1213703308.6824.18.camel@thorsten-desktop> Mime-Version: 1.0 X-Mailer: Evolution 2.22.2 Content-Transfer-Encoding: 7bit X-SA-Report: * -0.2 ALL_TRUSTED Passed through trusted hosts only via SMTP X-Spam-Score: -0.2 (/) X-Spam-Score-Int: -1 X-Virus-Checked: Checked by ClamAV on apache.org On Sat, 2008-06-14 at 00:32 +0900, Dr. Bhatia Praveen wrote: > > > Hello, > I use an [1] xml file -> [2] transformed to doc by xslt -> > [3]transformed to xml by my sitemap pipeline -> [4] transformed to > html by forrest. > The relevant string that gets transformed in this process is as > follows (Please note these are Japanese character strings): > [1] > > Japanese string1 > Japanese string2 > > At [3} these strings after transformation looks correctly like: > -
  • > Japanese > string2 >
  • > At [4] in html source it looks like: > Japanese string2 > > Hence Japanese string1 at the href location got converted to strange > characters... > > Problem: Till step [3] the transformation is correct in Japanese. At > step [4] the the href= portion's Japanese has become special > characters while the value place japanese is in normal Japanese. My > sayText() library function can't interpret these characters and > requires normal Japanese there. > > What can I do to get the normal Japanese at the href= location after > the transformation to html from forrest site? Actually that is a bit cumbersome and I recommend to not directly use the japanese character in the href attribute. The problem is that the attribute href will be encoded via the link rewrite. Workaround: > thanks > Praveen -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions