hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From modemide <modem...@gmail.com>
Subject MultipleOutputs - Create multiple files during output
Date Thu, 01 Sep 2011 19:47:01 GMT
Hi all,
I was wondering if anyone was familiar with this class.  I want to
create multiple output files during my reduce.

My input files will consist of
<name1><action1><date1>
<name1><action2><date2>
<name1><action3><date3>

<name2><action1><date1>
<name2><action2><date2>
<name2><action3><date3>


My goal is to create files with the following format
Filename:
<name>_<Date:CCYYMM>

File Contents:
<action1>
<action2>
<action3>


I.e. This will store all the actions of one person for any given month
in one file.

I just don't know how I will decide the file name at run time.  Can anyone help?

Thanks,
Tim

Mime
View raw message