hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Work logged] (HIVE-21292) Break up DDLTask 1 - extract Database related operations
Date Fri, 22 Feb 2019 11:04:03 GMT


ASF GitHub Bot logged work on HIVE-21292:

                Author: ASF GitHub Bot
            Created on: 22/Feb/19 11:03
            Start Date: 22/Feb/19 11:03
    Worklog Time Spent: 10m 
      Work Description: kgyrtkirk commented on pull request #543: HIVE-21292: Break up DDLTask
1 - extract Database related operations

 File path: ql/src/java/org/apache/hadoop/hive/ql/parse/
 @@ -2571,11 +2567,10 @@ private void analyzeDescDatabase(ASTNode ast) throws SemanticException
       throw new SemanticException("Unexpected Tokens at DESCRIBE DATABASE");
-    DescDatabaseDesc descDbDesc = new DescDatabaseDesc(ctx.getResFile(),
-        dbName, isExtended);
+    DescDatabaseDesc descDbDesc = new DescDatabaseDesc(ctx.getResFile(), dbName, isExtended);
     inputs.add(new ReadEntity(getDatabase(dbName)));
-    rootTasks.add(TaskFactory.get(new DDLWork(getInputs(), getOutputs(), descDbDesc)));
-    setFetchTask(createFetchTask(descDbDesc.getSchema()));
+    rootTasks.add(TaskFactory.get(new DDLWork2(getInputs(), getOutputs(), descDbDesc)));
+    setFetchTask(createFetchTask(DESC_DATABASE_SCHEMA));
 Review comment:
   I think the `schema` should works similarily to earlier; how about asking the DDLWork for
the schema? it could look up based on the desc
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

Issue Time Tracking

    Worklog Id:     (was: 202545)

> Break up DDLTask 1 - extract Database related operations
> --------------------------------------------------------
>                 Key: HIVE-21292
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>    Affects Versions: 3.1.1
>            Reporter: Miklos Gergely
>            Assignee: Miklos Gergely
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0
>         Attachments: HIVE-21292.01.patch, HIVE-21292.02.patch, HIVE-21292.03.patch, HIVE-21292.04.patch,
>          Time Spent: 2h 20m
>  Remaining Estimate: 0h
> DDLTask is a huge class, more than 5000 lines long. The related DDLWork is also a huge
class, which has a field for each DDL operation it supports. The goal is to refactor these
in order to have everything cut into more handleable classes under the package  org.apache.hadoop.hive.ql.exec.ddl:
>  * have a separate class for each operation
>  * have a package for each operation group (database ddl, table ddl, etc), so the amount
of classes under a package is more manageable
>  * make all the requests (DDLDesc subclasses) immutable
>  * DDLTask should be agnostic to the actual operations
>  * right now let's ignore the issue of having some operations handled by DDLTask which
are not actual DDL operations (lock, unlock, desc...)
> In the interim time when there are two DDLTask and DDLWork classes in the code base the
new ones in the new package are called DDLTask2 and DDLWork2 thus avoiding the usage of fully
qualified class names where both the old and the new classes are in use.
> Step #1: extract all the database related operations from the old DDLTask, and move them
under the new package. Also create the new internal framework.

This message was sent by Atlassian JIRA

View raw message