[ https://issues.apache.org/jira/browse/PIG-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15431846#comment-15431846
]
Daniel Dai commented on PIG-4912:
---------------------------------
We only need to fix on Tez side. MR side is good. We don't need to change JobControlCompiler.addToDistributedCache.
Instead, we shall fix TezResourceManager to handle the extension. I believe in TezResourceManager.getTezResources,
if we create LocalResource into different type as per extension should do the trick.
> Tez code does not differentiate between cache archives and files
> ----------------------------------------------------------------
>
> Key: PIG-4912
> URL: https://issues.apache.org/jira/browse/PIG-4912
> Project: Pig
> Issue Type: Improvement
> Reporter: Rohini Palaniswamy
> Assignee: Artem Ervits
> Fix For: 0.17.0
>
> Attachments: PIG-4912-0.patch, PIG-4912-1.patch
>
>
> Mapreduce code handles archives but Tez code does not.
> {code}
> if (DISTRIBUTED_CACHE_ARCHIVE_MATCHER.reset(uri.toString()).find()) {
> DistributedCache.addCacheArchive(uri, conf);
> } else {
> DistributedCache.addCacheFile(uri, conf);
> }
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
|