hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dmitry Sivachenko <trtrmi...@gmail.com>
Subject Re: Writing output from streaming task without dealing with key/value
Date Thu, 11 Sep 2014 07:16:03 GMT
After streaming job outputs some data to stdout, some hadoop code receives it and splits into
key/value pair before it reaches TextOutputFormat.
Can anyone point me to that piece of code please?

Thanks!

On 11 сент. 2014 г., at 0:37, Dmitry Sivachenko <trtrmitya@gmail.com> wrote:

> 
> On 10 сент. 2014 г., at 22:33, Felix Chern <idryman@gmail.com> wrote:
> 
>> Use ‘tr -s’ to stripe out tabs?
>> 
>> $ echo -e "a\t\t\tb"
>> a			b
>> 
>> $ echo -e "a\t\t\tb" | tr -s "\t"
>> a	b
>> 
> 
> There can be tabs in the input, I want to keep input lines without any modification.
> 
> Actually it is rather standard task: process lines one by one without inserting extra
characters.  There should be standard solution for it IMO.
> 


Mime
View raw message