hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Manoj Babu <manoj...@gmail.com>
Subject Processing Large XML in Hadoop
Date Sun, 15 Jul 2012 16:37:42 GMT
Hi,

Could you kindly explain the pros and cons of using Hadoop's
StreamInputFormat and Mahout XmlInputFormat.
How the record reader reads the record if it across the other blocks when
dealing with large size xml files?

Thanks in advance.

Cheers!
Manoj.

Mime
View raw message