tajo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Henry Saputra (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TAJO-337) Generic StorageHandler to provide common storage methods
Date Fri, 29 Nov 2013 06:43:35 GMT

    [ https://issues.apache.org/jira/browse/TAJO-337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13835207#comment-13835207

Henry Saputra commented on TAJO-337:

For this one, were you thinking about want to build Tajo specific storage API or leverage
existing library like Apache Gora?

> Generic StorageHandler to provide common storage methods
> --------------------------------------------------------
>                 Key: TAJO-337
>                 URL: https://issues.apache.org/jira/browse/TAJO-337
>             Project: Tajo
>          Issue Type: Improvement
>          Components: catalog, storage
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 1.0-incubating
> Currently, Tajo uses HDFS as a primary storage. But, as a data warehouse system, Tajo
should easily support various data sources.
> For this, I propose a generic storage handler interface that provides common storage
> * splitting input data
> * finding a cluster node which is nearest neighbor to data
> * accessing catalog
> * creating a table
> * removing a table
> The above methods are derived from query proecssing mechanism on data sets stored in
> Later, we can add easily storage handlers for HBase or other data sources.

This message was sent by Atlassian JIRA

View raw message