Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 70037F404 for ; Mon, 29 Apr 2013 03:52:22 +0000 (UTC) Received: (qmail 71938 invoked by uid 500); 29 Apr 2013 03:52:22 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 71775 invoked by uid 500); 29 Apr 2013 03:52:19 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 71464 invoked by uid 500); 29 Apr 2013 03:52:16 -0000 Delivered-To: apmail-incubator-crunch-dev@incubator.apache.org Received: (qmail 71440 invoked by uid 99); 29 Apr 2013 03:52:16 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 29 Apr 2013 03:52:16 +0000 Date: Mon, 29 Apr 2013 03:52:16 +0000 (UTC) From: "Josh Wills (JIRA)" To: crunch-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CRUNCH-199) Add Source and Target Support for AvroTrevniKey files MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CRUNCH-199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13644246#comment-13644246 ] Josh Wills commented on CRUNCH-199: ----------------------------------- Hey Micah-- the thing that moves the data from the temporary location that is created during the crunch job run to the final location is the MSCROutputHandler in o.a.c.impl.mr.plan. I think the source of the issue is at the end of the configureMapReduce function in the TrevniKeyTarget, where you reset the outputPath. I think that this line should go. Also, I agree with your comment: the TrevniKeyOutputFormat isn't aware of multiple schema outputs within the context of a single job, so you'll need to use a FormatBundle directly so that different config schemas don't conflict with each other. I'll post a patch to demonstrate. > Add Source and Target Support for AvroTrevniKey files > ----------------------------------------------------- > > Key: CRUNCH-199 > URL: https://issues.apache.org/jira/browse/CRUNCH-199 > Project: Crunch > Issue Type: Wish > Components: Core, IO > Reporter: Micah Whitacre > Assignee: Josh Wills > Priority: Minor > Attachments: CRUNCH-199.patch > > > Add Source, Target, and SourceTarget, support for AvroTrevniKeyInputFormat[1] and AvroTrevniKeyOutputFormat[2]. > [1] - http://avro.apache.org/docs/1.7.4/api/java/org/apache/trevni/avro/mapreduce/AvroTrevniKeyInputFormat.html > [2] - http://avro.apache.org/docs/1.7.4/api/java/org/apache/trevni/avro/mapreduce/AvroTrevniKeyOutputFormat.html -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira