hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ning Zhang (JIRA)" <>
Subject [jira] Updated: (HIVE-1820) Make Hive database data center aware
Date Wed, 08 Dec 2010 06:55:01 GMT


Ning Zhang updated HIVE-1820:

    Attachment: HIVE-1820.patch

attaching HIVE-1820.patch for review. 

> Make Hive database data center aware
> ------------------------------------
>                 Key: HIVE-1820
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>         Attachments: HIVE-1820.patch
> In order to support multiple data centers (different DFS, MR clusters) for hive, it is
desirable to extend Hive database to be data center aware. 
> Currently Hive database is a logical concept and has no DFS or MR cluster info associated
with it. Database has the location property indicating the default warehouse directory, but
user cannot specify and change it. In order to make it data center aware, the following info
need to be maintained:
> 1) data warehouse root location which is the default HDFS location for newly created
tables (default=hive.metadata.warehouse.dir).
> 2) scratch dir which is the HDFS location where MR intermediate files are created (default=hive.exec.scratch.dir)
> 3) MR job tracker URI that jobs should be submitted to (default=mapred.job.tracker)
> 4) hadoop (bin) dir ($HADOOP_HOME/bin/hadoop)
> These parameters should be saved in database.parameters (key, value) pair and they overwrite
the jobconf parameters (so if the default database has no parameter it will get it from the
hive-default.xml or hive-site.xml as it is now). 

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message