hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mafish Liu" <maf...@gmail.com>
Subject Re: Output directory already exists
Date Wed, 03 Sep 2008 01:33:45 GMT
On Wed, Sep 3, 2008 at 1:24 AM, Shirley Cohen <scohen@cs.utexas.edu> wrote:

> Hi,
> I'm trying to write the output of two different map-reduce jobs into the
> same output directory. I'm using MultipleOutputFormats to set the filename
> dynamically, so there is no filename collision between the two jobs.
> However, I'm getting the error "output directory already exists".
> Does the framework support this functionality? It seems silly to have to
> create a temp directory to store the output files from the second job and
> then have to copy them to the first job's output directory after the second
> job completes.

Map/reduce will create output directory every time it runs and will fail if
the directory exists. Seems that there is no way to implement your
description other than modify the source code.

> Thanks,
> Shirley

Institute of Computing Technology, Chinese Academy of Sciences, Beijing.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message