hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yuncong Chen <cyc3...@gmail.com>
Subject Any streaming options that specify data type of key value pairs?
Date Fri, 25 Jan 2013 02:08:10 GMT

What would be the best way in hadoop streaming to send a binary object (i.e. a python dict,
array) as the value in <key,value> pairs?

I know I can dump the object to string with pickle.dumps() and encode it to eliminate unintended
'\t','\n' before sending it to stdout, but I wonder if there are native streaming options
that specifies the data type of <key,value> pairs?

View raw message