hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aaron Myers <...@cloudera.com>
Subject Re: cleaning up 'hadoop.tmp.dir' ?
Date Tue, 09 Nov 2010 16:23:54 GMT
On Tue, Nov 9, 2010 at 6:29 AM, Wojciech Langiewicz
<wlangiewicz@gmail.com>wrote:

>
> What might be causing situation where I have about 5TB in HDFS and hadoop
> tmp dirs have about 16TB in total?
>

If indeed this is the block data of your HDFS files, then this makes perfect
sense. HDFS by default replicates every block 3 times, so ~5TB used in HDFS
is ~15TB raw on disk.

Aaron

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message