hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashish Thusoo (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3601) Hive as a contrib project
Date Tue, 02 Sep 2008 14:11:45 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12627689#action_12627689
] 

Ashish Thusoo commented on HADOOP-3601:
---------------------------------------

Sorry that was a bit incorrect. What I meant is that we have some untested support for MAPs
in the query layer and what you are likely hitting are the results of that. You can try lifting
the restriction for the integers and see what happens. We will try that internally as well.

Are you creating this table through a DDL. If that is the case then that would not work. We
do not yet have support for the serde that generically serializes and deserializes maps and
lists. We rely on thrift to do that. So you should try it with a thrift table. And for now
such tables can only be created programmatically...


Ashish

> Hive as a contrib project
> -------------------------
>
>                 Key: HADOOP-3601
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3601
>             Project: Hadoop Core
>          Issue Type: Wish
>          Components: contrib/hive
>    Affects Versions: 0.19.0
>         Environment: N/A
>            Reporter: Joydeep Sen Sarma
>            Assignee: Ashish Thusoo
>            Priority: Minor
>             Fix For: 0.19.0
>
>         Attachments: hive.tgz, hive.tgz, hive.tgz, HiveTutorial.pdf
>
>   Original Estimate: 1080h
>  Remaining Estimate: 1080h
>
> Hive is a data warehouse built on top of flat files (stored primarily in HDFS). It includes:
> - Data Organization into Tables with logical and hash partitioning
> - A Metastore to store metadata about Tables/Partitions etc
> - A SQL like query language over object data stored in Tables
> - DDL commands to define and load external data into tables
> Hive's query language is executed using Hadoop map-reduce as the execution engine. Queries
can use either single stage or multi-stage map-reduce. Hive has a native format for tables
- but can handle any data set (for example json/thrift/xml) using an IO library framework.
> Hive uses Antlr for query parsing, Apache JEXL for expression evaluation and may use
Apache Derby as an embedded database for MetaStore. Antlr has a BSD license and should be
compatible with Apache license.
> We are currently thinking of contributing to the 0.17 branch as a contrib project (since
that is the version under which it will get tested internally) - but looking for advice on
the best release path.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message