hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Allen Wittenauer ...@apache.org>
Subject Re: Emit an entire file
Date Thu, 30 Jun 2011 18:52:43 GMT

On Jun 28, 2011, at 6:19 AM, Jeremy Cunningham wrote:

> I have lots of binary files stored in hdfs.  I read them using Apache POI and can search
with no problems. I want to be able to search for keywords (which I can do) and then copy
the file that has the text out to a different location.  The location can be in hdfs but I
just need a location that contains all the files that meet my criteria.

	There is an entire file system API that enables one to read and write files to HDFS. Additionally,
the user specifies where the output is written during a map-reduce job. 

	So.... is there a specific question that you need answered?
View raw message