hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aman <aman_d...@hotmail.com>
Subject Re: What exactly are the output_dir/part-00000 semantics (of a streaming job) ?
Date Thu, 12 May 2011 16:49:23 GMT
The creation of files part-nnnnn is atomic. When you run a MR job, these
files are created in directory <output_dir>/_temporary and moved to
<output_dir> after the files is closed for writing. This move is atomic
hence as long as you don't try to read these files from temporary directory
(which I see you are not) you will be fine. 

View this message in context: http://lucene.472066.n3.nabble.com/What-exactly-are-the-output-dir-part-00000-semantics-of-a-streaming-job-tp2931125p2932598.html
Sent from the Hadoop lucene-users mailing list archive at Nabble.com.

View raw message