crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Juhn <>
Subject Removing PCollection.cache call resulting in two MR jobs writing to same path
Date Fri, 22 Jul 2016 15:48:04 GMT

I removed a .cache call and am seeing some troublesome behavior.  It results in two nodes
in Crunch's execution graph writing to the same output path.  When I add the .cache call back
I end up with one node writing to crunch tmp space, and the other node writing to the output

Is this expected behavior?


View raw message