hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pradeep Kamath (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-814) Make Binstorage more robust when data contains record markers
Date Fri, 22 May 2009 20:30:45 GMT

     [ https://issues.apache.org/jira/browse/PIG-814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Pradeep Kamath updated PIG-814:
-------------------------------

      Resolution: Fixed
    Hadoop Flags: [Reviewed]
          Status: Resolved  (was: Patch Available)

This patch fixes existing code which is tested in other tests, hence no tests were included.

Patch committed.

> Make Binstorage more robust when data contains record markers
> -------------------------------------------------------------
>
>                 Key: PIG-814
>                 URL: https://issues.apache.org/jira/browse/PIG-814
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.2.1
>            Reporter: Pradeep Kamath
>            Assignee: Pradeep Kamath
>             Fix For: 0.3.0
>
>         Attachments: PIG-814.patch
>
>
> When the inputstream for BinStorage is at a position where the data has the record marker
sequence, the code incorrectly assumes that it is at the beginning of a record (tuple) and
calls DataReaderWriter.readDatum() trying to read the tuple. The problem is more likely when
RandomSampleLoader (used in order by implementation) skips the input stream for sampling and
calls Binstorage.getNext(). The code should be more robust in such cases

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message