tajo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hyunsik Choi (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TAJO-337) Generic StorageHandler to provide common storage methods
Date Wed, 27 Nov 2013 04:21:35 GMT
Hyunsik Choi created TAJO-337:

             Summary: Generic StorageHandler to provide common storage methods
                 Key: TAJO-337
                 URL: https://issues.apache.org/jira/browse/TAJO-337
             Project: Tajo
          Issue Type: Improvement
          Components: catalog, storage
            Reporter: Hyunsik Choi
            Assignee: Hyunsik Choi
             Fix For: 1.0-incubating

Currently, Tajo uses HDFS as a primary storage. But, as a data warehouse system, Tajo should
easily support various data sources.

For this, I propose a generic storage handler interface that provides common storage methods:
* splitting input data
* finding a cluster node which is nearest neighbor to data
* accessing catalog
* creating a table
* removing a table

The above methods are derived from query proecssing mechanism on data sets stored in HDFS.

Later, we can add easily storage handlers for HBase or other data sources.

This message was sent by Atlassian JIRA

View raw message