hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1284) pig UDF is lacking XMLLoader. Plan to add the XMLLoader
Date Fri, 12 Mar 2010 13:04:27 GMT

    [ https://issues.apache.org/jira/browse/PIG-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12844487#action_12844487

Hadoop QA commented on PIG-1284:

-1 overall.  Here are the results of testing the latest attachment 
  against trunk revision 922097.

    +1 @author.  The patch does not contain any @author tags.

    +1 tests included.  The patch appears to include 4 new or modified tests.

    +1 javadoc.  The javadoc tool did not generate any warning messages.

    +1 javac.  The applied patch does not increase the total number of javac compiler warnings.

    +1 findbugs.  The patch does not introduce any new Findbugs warnings.

    +1 release audit.  The applied patch does not increase the total number of release audit

    -1 core tests.  The patch failed core unit tests.

    +1 contrib tests.  The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/235/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/235/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/235/console

This message is automatically generated.

> pig UDF is lacking XMLLoader. Plan to add the XMLLoader
> -------------------------------------------------------
>                 Key: PIG-1284
>                 URL: https://issues.apache.org/jira/browse/PIG-1284
>             Project: Pig
>          Issue Type: New Feature
>    Affects Versions: 0.7.0
>            Reporter: Alok Singh
>             Fix For: 0.7.0
>         Attachments: pigudf_xmlLoader.patch, pigudf_xmlLoader.patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
> Hi All,
>  We are planning to add the XMLLoader UDF in the piggybank repository.
> Here is the proposal with the user docs :-
>  The load function to load the XML file
>  This will implements the LoadFunc interface which is used to parse records
>  from a dataset.
>  This takes a xmlTag as the arg which it will use to split the inputdataset into
>  multiple records.
>  For example if the input xml (input.xml) is like this
>  <configuration>
>  <property>
>  <name> foobar </name>
>  <value> barfoo </value>
>  </property>
>  <ignoreProperty>
>  <name> foo </name>
>  </ignoreProperty>
>  <property>
>  <name> justname </name>
>  </property>
>  </configuration>
>  And your pig script is like this
>  --load the jar files
>  register loader.jar;
>  -- load the dataset using XMLLoader
>  -- A is the bag containing the tuple which contains one atom i.e doc see output
>  A = load '/user/aloks/pig/input.xml using loader.XMLLoader('property') as (doc:chararray);
>  --dump the result
>  dump A;
>  Then you will get the output
> (<property>
> <name> foobar </name>
> <value> barfoo </value>
> </property>)
> (<property>
> <name> justname </name>
> </property>)
> Where each () indicate one record

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message