hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shashidhar Rao <raoshashidhar...@gmail.com>
Subject Merging small files
Date Sat, 19 Jul 2014 17:29:48 GMT
Hi ,

Has anybody worked in retail use case. If my production Hadoop cluster
block size is 256 MB but generally if we have to process retail invoice
data , each invoice data is merely let's say 4 KB . Do we merge the invoice
data to make one large file say 1 GB . What is the best practice in this
scenario


Regards
Shashi

Mime
View raw message