hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rui Li (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-7333) Create RDD translator, translating Hive Tables into Spark RDDs
Date Wed, 13 Aug 2014 01:46:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14095022#comment-14095022
] 

Rui Li commented on HIVE-7333:
------------------------------

Sorry I forgot to put these here:

I tested the following kinds of tables for spark:
- TextFile
- SequenceFile
- Orc
- RCFile
- Parquet
- Avro
- Partitioned table
- Bucketed table
- HBase backed table

For each table, I load some data into it and run a count (*) query. For partitioned table
the query is run for only one partition. I verified all the results are correct. There's some
issue with HBase table though, I created HIVE-7467 for it.

The data size is small and the test query is simple. So please let me know if there's more
tables to be tested or we need more complicated test cases for specific tables.

> Create RDD translator, translating Hive Tables into Spark RDDs
> --------------------------------------------------------------
>
>                 Key: HIVE-7333
>                 URL: https://issues.apache.org/jira/browse/HIVE-7333
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Xuefu Zhang
>            Assignee: Rui Li
>
> Please refer to the design specification.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message