hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johannes.Lichtenberger" <Johannes.Lichtenber...@uni-konstanz.de>
Subject Re: ClassCastException
Date Thu, 07 Oct 2010 22:13:39 GMT
On 10/08/2010 12:01 AM, Ted Yu wrote:
> http://www.ibm.com/developerworks/xml/library/x-stax2.html

Yes, my approach is to parse a very big XML file (wikipedia revisions)
with StAX in my RecordReader implementation. The key is a timestamp and
the values are List<XMLEventWritable>s, because I don't want to have to
setup a new StAX-Parser in every Map, but ok, I assume the cost of
setting up StAX-Parsers is negligible, so I can write Text-values
instead of List<XMLEventWritable>-values.


View raw message