hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gopal Vijayaraghavan <gop...@apache.org>
Subject Re: HIVE CLI does not escape \t ?
Date Thu, 21 Jan 2016 19:37:08 GMT

>I use the workaround cat * >> output.tsv but that's not ideal.
>
>Any way to constrain the number of files to 1 automatically?

I generally use an "ORDER BY 0" to insert a single reducer, which produces
exactly 1 file.

This is generally not a problem if you have say, <= 1 million rows.

HDFS allows only 1 writer per file - to get a single file, a single
reducer task has to write out the entire file.

Cheers,
Gopal



Mime
View raw message