hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chao Sun <sunc...@apache.org>
Subject Re: Dataset for Hive
Date Thu, 02 Apr 2015 06:56:43 GMT
Hi Xiaohe,

You can try TPC-DS from https://github.com/hortonworks/hive-testbench.
It contains large number of queries with complex joins.

Chao

On Wed, Apr 1, 2015 at 9:30 PM, xiaohe lan <zombiexcoder@gmail.com> wrote:

> Hi All,
>
> I am new to Hive. Just set up a 5 node Hadoop environment and want to have
> a try on HiveQL.
> Is there any dataset I can download to play HiveQL. The dataset should have
> several tables some I can write some complex join. About 100G should be
> fine.
>
> Thanks,
> Xiaohe
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message