hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (MAPREDUCE-6299) bzip2 codec read duplicate rows
Date Mon, 06 Apr 2015 22:10:12 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-6299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jason Lowe reassigned MAPREDUCE-6299:

    Assignee:     (was: Jason Lowe)

Sorry for the delay, as I was out on vacation.  I am missing quite a bit of context for this
JIRA and am not sure why it was assigned to me.  [~hive.bugs] can you provide more context
and/or a reproducible test case?

> bzip2 codec read duplicate rows
> -------------------------------
>                 Key: MAPREDUCE-6299
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6299
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 2.4.0
>            Reporter: Keith Ly
>            Priority: Critical
> select count(*) from bzip_table shows 36 rows count when there are 18 actual rows in
bzip_table. Create table bzip_table2 as select * from bzip_table results in 36 rows in bzip_table2
and so on.

This message was sent by Atlassian JIRA

View raw message