commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <>
Subject [Digester] HTML entity decoding?
Date Wed, 15 Apr 2009 22:06:28 GMT


I'm using Digester 2.0 and trying to process XML that
may include HTML entities and trying to get Digester to decode them
when parsing.

For example, my XML contains:

Currently, Digester is parses this as:  Gr&uuml;ber

But what I am really after is "GrĂ¼ber", so I am looking for a way to get this &uuml;
entity decoded by Digester.
How do I tell Digester to decode HTML entities?

Also, if I don't use CDATA, like this:

Digester gives me: Grber

Any help would be very appreciated.  Thanks,

Sematext -- -- Lucene - Solr - Nutch

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message