pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph Adler (JIRA)" <j...@apache.org>
Subject [jira] [Created] (PIG-2266) bug with input file joining optimization in Pig
Date Tue, 06 Sep 2011 18:27:10 GMT
bug with input file joining optimization in Pig
-----------------------------------------------

                 Key: PIG-2266
                 URL: https://issues.apache.org/jira/browse/PIG-2266
             Project: Pig
          Issue Type: Bug
          Components: impl
    Affects Versions: 0.9.0
            Reporter: Joseph Adler


In src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MRCompiler.java, the function
hasTooManyInputFiles instantiated a LoadFunc instance, then calls setLocation before calling
setUDFContextSignature. This is inconsistent with the documentation for the LoadFunc interface
(see http://pig.apache.org/docs/r0.9.0/api/org/apache/pig/LoadFunc.html#setUDFContextSignature(java.lang.String)).
(We've written UDFs that assume that setUDFContextSignature is called first.)

I think you can fix this by adding 

   loader.setUDFContextSignature(ld.getSignature());

Before

   loader.setLocation(location, job);

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message