crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom White (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-502) OutputFormat has inconsistent context state in interface functions
Date Tue, 03 Mar 2015 18:43:05 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345476#comment-14345476
] 

Tom White commented on CRUNCH-502:
----------------------------------

I checked that this doesn't break CRUNCH-481. I did notice that it doesn't compile under Hadoop
1 though - is that still a problem?

> OutputFormat has inconsistent context state in interface functions
> ------------------------------------------------------------------
>
>                 Key: CRUNCH-502
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-502
>             Project: Crunch
>          Issue Type: Bug
>          Components: IO
>    Affects Versions: 0.12.0
>            Reporter: Mārtiņš Kalvāns
>            Assignee: Josh Wills
>         Attachments: CRUNCH-502.patch
>
>
> I created example project to demonstrate problematic behaviour:
> https://github.com/sisidra/crunch-ofb
> 1. FormatBundle config is not populated to Configuration in checkOutputSpecs:
> https://github.com/sisidra/crunch-ofb/blob/master/src/main/java/com/spotify/crunch/bugreport/MyOutputFormat.java#L39
> {code}
> 15/03/02 15:40:24 INFO bugreport.MyOutputFormat: my.config.key (checkOutputSpecs): null
> 15/03/02 15:40:24 ERROR bugreport.MyOutputFormat: Wrong my.config.key value in checkOutputSpecs!
> {code}
> 2. TaskAttemptContext. getTaskAttemptID().toString() is different in getRecordWriter
and getOutputCommitter:
> {code}
> 2015-03-02 15:40:38,960 INFO [main] com.spotify.crunch.bugreport.MyOutputFormat: TaskAttemptID
(getOutputCommitter): attempt_1422406067005_0121_m_000000_0
> ...
> 2015-03-02 15:40:39,789 INFO [main] com.spotify.crunch.bugreport.MyOutputFormat: TaskAttemptID
(getRecordWriter): attempt_1422406067005_out0_0121_m_000000_0
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message