hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wasim Bari <wasimb...@msn.com>
Subject XML files in HDFS
Date Thu, 30 Jul 2009 08:30:31 GMT


Hi All,

       I am looking to store some real big xml files in HDFS and then process them using MapReduce.


Do we have some utility which uploads the xml files to hdfs making sure split  up of file
in block doen't brake an elemet ( mean half element on one block and half on someother ) ?


Any suggestions to work thos out will  be appreciated greatly.





  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message