hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ankit Bhatnagar <>
Subject Re: HCatInputFormat combine splits
Date Thu, 14 May 2015 17:27:26 GMT
you can explicitly set the split size 

     On Wednesday, May 13, 2015 11:37 PM, Pradeep Gollakota <> wrote:

 Hi All,
I'm writing an MR job to read data using HCatInputFormat... however, the job is generating
too many splits. I don't have this problem when running queries in Hive since it combines
splits by default.
Is there an equivalent in MR so that I'm not generating thousands of mappers?

View raw message