crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Wills (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (CRUNCH-502) OutputFormat has inconsistent context state in interface functions
Date Tue, 03 Mar 2015 21:17:05 GMT

     [ https://issues.apache.org/jira/browse/CRUNCH-502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Josh Wills updated CRUNCH-502:
------------------------------
    Attachment: CRUNCH-502c.patch

Managed to simplify this and make it less awful, if still a bit odd. I verified it works on
both hadoop1 and hadoop2, will commit it soon.

> OutputFormat has inconsistent context state in interface functions
> ------------------------------------------------------------------
>
>                 Key: CRUNCH-502
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-502
>             Project: Crunch
>          Issue Type: Bug
>          Components: IO
>    Affects Versions: 0.12.0
>            Reporter: Mārtiņš Kalvāns
>            Assignee: Josh Wills
>         Attachments: CRUNCH-502.patch, CRUNCH-502b.patch, CRUNCH-502c.patch
>
>
> I created example project to demonstrate problematic behaviour:
> https://github.com/sisidra/crunch-ofb
> 1. FormatBundle config is not populated to Configuration in checkOutputSpecs:
> https://github.com/sisidra/crunch-ofb/blob/master/src/main/java/com/spotify/crunch/bugreport/MyOutputFormat.java#L39
> {code}
> 15/03/02 15:40:24 INFO bugreport.MyOutputFormat: my.config.key (checkOutputSpecs): null
> 15/03/02 15:40:24 ERROR bugreport.MyOutputFormat: Wrong my.config.key value in checkOutputSpecs!
> {code}
> 2. TaskAttemptContext. getTaskAttemptID().toString() is different in getRecordWriter
and getOutputCommitter:
> {code}
> 2015-03-02 15:40:38,960 INFO [main] com.spotify.crunch.bugreport.MyOutputFormat: TaskAttemptID
(getOutputCommitter): attempt_1422406067005_0121_m_000000_0
> ...
> 2015-03-02 15:40:39,789 INFO [main] com.spotify.crunch.bugreport.MyOutputFormat: TaskAttemptID
(getRecordWriter): attempt_1422406067005_out0_0121_m_000000_0
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message