pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-4912) Tez code does not differentiate between cache archives and files
Date Tue, 23 Aug 2016 00:14:20 GMT

    [ https://issues.apache.org/jira/browse/PIG-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15431846#comment-15431846
] 

Daniel Dai commented on PIG-4912:
---------------------------------

We only need to fix on Tez side. MR side is good. We don't need to change JobControlCompiler.addToDistributedCache.
Instead, we shall fix TezResourceManager to handle the extension. I believe in TezResourceManager.getTezResources,
if we create LocalResource into different type as per extension should do the trick.

> Tez code does not differentiate between cache archives and files
> ----------------------------------------------------------------
>
>                 Key: PIG-4912
>                 URL: https://issues.apache.org/jira/browse/PIG-4912
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Rohini Palaniswamy
>            Assignee: Artem Ervits
>             Fix For: 0.17.0
>
>         Attachments: PIG-4912-0.patch, PIG-4912-1.patch
>
>
> Mapreduce code handles archives but Tez code does not.
> {code}
> if (DISTRIBUTED_CACHE_ARCHIVE_MATCHER.reset(uri.toString()).find()) {
>             DistributedCache.addCacheArchive(uri, conf);
>         } else {
>             DistributedCache.addCacheFile(uri, conf);
>         }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message