forrest-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From <greg.v...@cox.net>
Subject RE: Locating a UTF-8 sequence error?
Date Tue, 01 Feb 2005 12:56:51 GMT
Sjur --

Thanks for this tip, it worked out really well.

-- Greg

-----Original Message-----
From: Sjur Moshagen [mailto:sjurnm@mac.com] 
Sent: Sunday, January 30, 2005 9:10 AM
To: user@forrest.apache.org
Subject: Re: Locating a UTF-8 sequence error?

På 29. jan. 2005 kl. 23.27 skrev greg.vaco@cox.net:

> I'm receiving an "Invalid byte 1 of 1-byte UTF-8 sequence" error; can 
> anyone tell me how to locate the offending character(s)?

If you have xmllint installed, just type:

xmllint <FILENAME>

It will give you the exact location of the offending character. I have 
used this method myself to clean a UTF-8 file with some invalid byte 
sequenses - works excellent as long as the number of invalid chars is 
small.

Sjur



Mime
View raw message