pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nezih Yigitbasi (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (PIG-3617) problem with temp file deletion in MAPREDUCE operator
Date Mon, 06 Jan 2014 17:55:56 GMT

     [ https://issues.apache.org/jira/browse/PIG-3617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nezih Yigitbasi updated PIG-3617:
---------------------------------

    Attachment: PIG-3617.patch

Cheolsoo, thanks for the reply. I also think that implementing a DataStorage interface for
HBase is an overkill. So I implemented a simple fix to skip HBase paths during temporary file
deletions in MapReduceLauncher like you proposed. Please review.


> problem with temp file deletion in MAPREDUCE operator
> -----------------------------------------------------
>
>                 Key: PIG-3617
>                 URL: https://issues.apache.org/jira/browse/PIG-3617
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.12.0
>            Reporter: Nezih Yigitbasi
>         Attachments: PIG-3617.patch
>
>
> Hi all,
> When I run a native MR job with the MAPREDUCE keyword and store the intermediate data
in HBase with:
>     stored = MAPREDUCE 'my.jar'
>               STORE x INTO 'hbase://temp_table'
>               USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('hbase_schema')
>               .... and the rest ....;
> Pig tries to delete the temp files, which in this case has an HBase path, and fails with
the exception:
> Caused by: java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative
path in absolute URI: file:hbase:/temp_table
>         at org.apache.hadoop.fs.Path.initialize(Path.java:148)
>         at org.apache.hadoop.fs.Path.<init>(Path.java:126)
>         at org.apache.pig.backend.hadoop.datastorage.HDataStorage.isContainer(HDataStorage.java:197)
>         at org.apache.pig.backend.hadoop.datastorage.HDataStorage.asElement(HDataStorage.java:128)
>         at org.apache.pig.impl.io.FileLocalizer.delete(FileLocalizer.java:415)
>         at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:419)
>         at org.apache.pig.PigServer.launchPlan(PigServer.java:1322)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message