hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Darren Govoni <dar...@ontrenet.com>
Subject Suitable for Hadoop?
Date Wed, 21 Jan 2009 13:07:37 GMT
  I have a task to process large quantities of files by converting them
into other formats. Each file is processed as a whole and converted to a
target format. Since there are 100's of GB of data I thought it suitable
for Hadoop, but the problem is, I don't think the files can be broken
apart and processed. For example, how would mapreduce work to convert a
Word Document to PDF if the file is reduced to blocks? I'm not sure
that's possible, or is it?

thanks for any advice

View raw message