commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject [Digester] HTML entity decoding?
Date Wed, 15 Apr 2009 22:06:28 GMT

Hello,

I'm using Digester 2.0 and trying to process XML that
may include HTML entities and trying to get Digester to decode them
when parsing.

For example, my XML contains:
  <name><![CDATA[Gr&uuml;ber]]></name>

Currently, Digester is parses this as:  Gr&uuml;ber

But what I am really after is "GrĂ¼ber", so I am looking for a way to get this &uuml;
entity decoded by Digester.
How do I tell Digester to decode HTML entities?

Also, if I don't use CDATA, like this:
  <name>Gr&uuml;ber</name>

Digester gives me: Grber

Any help would be very appreciated.  Thanks,

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@commons.apache.org
For additional commands, e-mail: user-help@commons.apache.org


Mime
View raw message