httpd-modules-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Kew <>
Subject Re: How do I know the character encoding?
Date Sat, 03 May 2008 12:17:36 GMT
On Fri, 2 May 2008 13:46:16 -0700 (PDT)
John Zhang <> wrote:

> In my output filter, I need to parse the document to search for
> certain patterns.
> Where can I get the information about the (character) encoding so
> that I can parse the document correctly?  Eg the document may contain
> unicode characters and are encoded in a special encoding. 


If your filter uses libxml2, just use mod_xml2enc alongside it.
If not, you can still use the charset detection and transcoding.

Nick Kew

Application Development with Apache - the Apache Modules Book

View raw message