pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nezih Yigitbasi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-3617) problem with temp file deletion in MAPREDUCE operator
Date Thu, 12 Dec 2013 23:17:08 GMT

    [ https://issues.apache.org/jira/browse/PIG-3617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13846891#comment-13846891

Nezih Yigitbasi commented on PIG-3617:

Guys, I plan to implement a solution that checks for an hbase prefix ("hbase://") in the fileSpec
variable (see FileLocalizer.delete), but this method currently uses the DataStorage interfaces
(either local or distributed) kept in PigContext to access the backend storage and as far
as I can see there is no HBase specific implementation for the DataStorage interface. So what
do you guys think is the right way to fix this issue? Is it implementing a DataStorage interface
for HBase or simply use HBaseAdmin.deleteTable to delete the temp table?

> problem with temp file deletion in MAPREDUCE operator
> -----------------------------------------------------
>                 Key: PIG-3617
>                 URL: https://issues.apache.org/jira/browse/PIG-3617
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.12.0
>            Reporter: Nezih Yigitbasi
> Hi all,
> When I run a native MR job with the MAPREDUCE keyword and store the intermediate data
in HBase with:
>     stored = MAPREDUCE 'my.jar'
>               STORE x INTO 'hbase://temp_table'
>               USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('hbase_schema')
>               .... and the rest ....;
> Pig tries to delete the temp files, which in this case has an HBase path, and fails with
the exception:
> Caused by: java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative
path in absolute URI: file:hbase:/temp_table
>         at org.apache.hadoop.fs.Path.initialize(Path.java:148)
>         at org.apache.hadoop.fs.Path.<init>(Path.java:126)
>         at org.apache.pig.backend.hadoop.datastorage.HDataStorage.isContainer(HDataStorage.java:197)
>         at org.apache.pig.backend.hadoop.datastorage.HDataStorage.asElement(HDataStorage.java:128)
>         at org.apache.pig.impl.io.FileLocalizer.delete(FileLocalizer.java:415)
>         at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:419)
>         at org.apache.pig.PigServer.launchPlan(PigServer.java:1322)

This message was sent by Atlassian JIRA

View raw message