kylin-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shaofeng SHI (JIRA)" <j...@apache.org>
Subject [jira] [Closed] (KYLIN-679) Adding Spark Support to Apache Kylin
Date Fri, 02 Feb 2018 02:22:01 GMT

     [ https://issues.apache.org/jira/browse/KYLIN-679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shaofeng SHI closed KYLIN-679.
------------------------------
       Resolution: Fixed
    Fix Version/s:     (was: Future)
                   v2.0.0

The Spark support has been implemented in 2.0; Close this Jira.

> Adding Spark Support to Apache Kylin
> ------------------------------------
>
>                 Key: KYLIN-679
>                 URL: https://issues.apache.org/jira/browse/KYLIN-679
>             Project: Kylin
>          Issue Type: New Feature
>          Components: General
>            Reporter: Luke Han
>            Priority: Major
>             Fix For: v2.0.0
>
>
> Challenges in current architecture:
> High latency when reading data from Hive 
> --Several hours to fetch data when join big tables
> --Route to SQL-on-Hadoop turned off due to performance issue
> Time-to-Market of data latency
> --Huge IO & Network traffic with MR jobs
> Streaming
> --Streaming process and pre-calculate cubes
> Where Spark could bring benefits to Kylin:
> Integrating with Spark SQL: 
> --Option I: Read data from SparkSQL instead of Hive
> --Option II: Route unsupported queries to SparkSQL
> --Option III: Kylin to be OLAP source of SparkSQL
> Spark Cube Build Engine
> --Efficiency cube generate engine with Spark
> Spark Streaming
> --Leverage SparkStreaming for StreamingOLAP
> HBase?
> --Any idea?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message