hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namit Jain (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HIVE-1403) Reporting progress to JT during closing files in FileSinkOperator
Date Tue, 15 Jun 2010 15:33:23 GMT

     [ https://issues.apache.org/jira/browse/HIVE-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Namit Jain updated HIVE-1403:
-----------------------------

           Status: Resolved  (was: Patch Available)
     Hadoop Flags: [Reviewed]
    Fix Version/s: 0.6.0
       Resolution: Fixed

Committed. Thanks Ning

> Reporting progress to JT during closing files in FileSinkOperator
> -----------------------------------------------------------------
>
>                 Key: HIVE-1403
>                 URL: https://issues.apache.org/jira/browse/HIVE-1403
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>             Fix For: 0.6.0
>
>         Attachments: HIVE-1403.patch
>
>
> If there are too many files need to be closed in FileSinkOperator (e.g., if DynamicPartition/FileSpray
is turned on), there could be many files generated by each task and they need to be closed
at the FileSinkOperator.closeOp(). If the NN is overloaded each file close could take more
than 1 sec. This sometimes make JT think the task is dead since it takes too long to close
all the files and without any progress report. We need to report progress after a while during
file closing. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message