hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gerrit Jansen van Vuuren (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-1237) Piggybank MutliStorage - specify field to write in output
Date Fri, 12 Feb 2010 13:44:28 GMT
Piggybank MutliStorage - specify field to write in output
---------------------------------------------------------

                 Key: PIG-1237
                 URL: https://issues.apache.org/jira/browse/PIG-1237
             Project: Pig
          Issue Type: Improvement
            Reporter: Gerrit Jansen van Vuuren
            Assignee: Gerrit Jansen van Vuuren
            Priority: Minor


I've made a modification to the piggy bank MutliStorage class that allows to optionally specify
the index of the field in each tuple to write to output.
This feature allows to have records with metadata like seqno, time of upload etc, and then
to combine files from these records into one but without the metadata.
e.g. 
1: date type seq1 data
2:  date type seq2 data

then write output grouped by type and ordered by sequence:
data
data



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message