crunch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Juhn <benjij...@gmail.com>
Subject Removing PCollection.cache call resulting in two MR jobs writing to same path
Date Fri, 29 Jul 2016 18:33:49 GMT
I removed a .cache call and am seeing some troublesome behavior.  It results in two nodes in
Crunch's execution graph writing to the same output path.  When I add the .cache call back
I end up with one node writing to crunch tmp space, and the other node writing to the output
path.  

Is this expected behavior?

Thanks,
Ben
Mime
View raw message