spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dean Wampler <>
Subject Re: How spark and hive integrate in long term?
Date Fri, 21 Nov 2014 23:12:18 GMT
I can't comment on plans for Spark SQL's support for Hive, but several
companies are porting Hive itself onto Spark:

I'm not sure if they are leveraging the old Shark code base or not, but it
appears to be a fresh effort.


Dean Wampler, Ph.D.
Author: Programming Scala, 2nd Edition
<> (O'Reilly)
Typesafe <>
@deanwampler <>

On Fri, Nov 21, 2014 at 2:51 PM, Zhan Zhang <> wrote:

> Now Spark and hive integration is a very nice feature. But I am wondering
> what the long term roadmap is for spark integration with hive. Both of
> these
> two projects are undergoing fast improvement and changes. Currently, my
> understanding is that spark hive sql part relies on hive meta store and
> basic parser to operate, and the thrift-server intercept hive query and
> replace it with its own engine.
> With every release of hive, there need a significant effort on spark part
> to
> support it.
> For the metastore part, we may possibly replace it with hcatalog. But given
> the dependency of other parts on hive, e.g., metastore, thriftserver,
> hcatlog may not be able to help much.
> Does anyone have any insight or idea in mind?
> Thanks.
> Zhan Zhang
> --
> View this message in context:
> Sent from the Apache Spark Developers List mailing list archive at
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message