hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mich Talebzadeh <mich.talebza...@gmail.com>
Subject Re: Using Spark on Hive with Hive also using Spark as its execution engine
Date Tue, 31 May 2016 20:56:51 GMT
Thanks for that Gopal.

Can LLAP be used as a caching tool for data from Oracle DB or any RDBMS.

In that case does it use JDBC to get the data out from the underlying DB?



Dr Mich Talebzadeh



LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com



On 31 May 2016 at 21:48, Gopal Vijayaraghavan <gopalv@apache.org> wrote:

>
> > but this sounds to me (without testing myself) adding caching capability
> >to TEZ to bring it on par with SPARK.
>
> Nope, that was the crux of the earlier email.
>
> "Caching" seems to be catch-all term misused in that comparison.
>
> >> There is a big difference between where LLAP & SparkSQL, which has to do
> >> with access pattern needs.
>
> On another note, LLAP can actually be used inside Spark as well, just use
> LlapContext instead of HiveContext.
>
>
> <
> http://www.slideshare.net/HadoopSummit/llap-subsecond-analytical-queries-i
> n-hive/30>
>
>
> I even have a Postgres FDW for LLAP, which is mostly used for analytics
> web dashboards which are hooked into Hive.
>
> https://github.com/t3rmin4t0r/llap_fdw
>
>
> LLAP can do 200-400ms queries, but Postgres can get to the sub 10ms when
> it comes to slicing-dicing result sets <100k rows.
>
> Cheers,
> Gopal
>
>
>

Mime
View raw message