tajo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "JaeHwa Jung (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TAJO-433) Improve integration with Hive
Date Thu, 19 Dec 2013 05:29:09 GMT
JaeHwa Jung created TAJO-433:

             Summary: Improve integration with Hive
                 Key: TAJO-433
                 URL: https://issues.apache.org/jira/browse/TAJO-433
             Project: Tajo
          Issue Type: Sub-task
          Components: catalog
    Affects Versions: 0.8-incubating
            Reporter: JaeHwa Jung
            Assignee: JaeHwa Jung
             Fix For: 0.8-incubating

Hi, guys~ 

I wish to discuss about HCatalogStore. current Tajo uses HCatalog to integrate catalog with
hive. So, users must run HiveMetaStoreServer because HCatalog connect HiveMetaStoreServer.

But at this point, it needs to make improvements.
- First is HiveMetaStoreServer. if it can fall down or it performance bottlenecks, it will
be a fatal problem for Tajo.
- Second is HCatalog API. It doesn't provide API for DDL. So, Tajo use HCatalog API and HiveMetaStoreClient

So, I wish to remove HiveMetaStoreServer dependency and integrate catalog with Hive using
HiveMetaStoreClient.  And I wish to use HiveMetaStoreClient API instead of HCatalog API. 
I already checked HiveMetaStoreClient to connect HiveMetaStore without HiveMetaStoreServer.

if HiveMetaStoreClient can't connect HiveMetaStore, Tajo try to connect HiveMetaStoreServer
as backup policy.

What do you think about this suggestion? :)

This message was sent by Atlassian JIRA

View raw message