hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zheng Shao (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-1100) Make it possible for users to retry map-reduce jobs in a single Hive query
Date Tue, 26 Jan 2010 02:00:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12804848#action_12804848

Zheng Shao commented on HIVE-1100:

bq. Is it a timeout based approach? When does the cleanup happen if the client never issues
a clear/continue ?

Actually we also have this problem today - if Hive failed before it cleans up the temporary
data, but with this feature the problem will be much more obvious.
We can add the date and time into "ExecutionId", so that an external process can come in and
clean things up.

bq. Do we need to enhance explain plan or some other tool so that the client knows which tasks
will the query execute ?

That's a good idea. 

8. Add the "done" information for each task to explain plan so that users can know the status
of the query.

> Make it possible for users to retry map-reduce jobs in a single Hive query
> --------------------------------------------------------------------------
>                 Key: HIVE-1100
>                 URL: https://issues.apache.org/jira/browse/HIVE-1100
>             Project: Hadoop Hive
>          Issue Type: New Feature
>    Affects Versions: 0.6.0
>            Reporter: Zheng Shao
>            Assignee: Zheng Shao
> Sometimes a single hive query get compiled into several map-reduce jobs, and one of the
jobs failed because of some transient errors.
> Users would need to start from scratch.
> We should allow the user to start from the point of failure to continue the query.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message