From user-return-4456-archive-asf-public=cust-asf.ponee.io@kylin.apache.org  Tue Jun 25 09:21:36 2019
Return-Path: <user-return-4456-archive-asf-public=cust-asf.ponee.io@kylin.apache.org>
X-Original-To: archive-asf-public@cust-asf.ponee.io
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [207.244.88.153])
	by mx-eu-01.ponee.io (Postfix) with SMTP id 188A018062F
	for <archive-asf-public@cust-asf.ponee.io>; Tue, 25 Jun 2019 11:21:35 +0200 (CEST)
Received: (qmail 40002 invoked by uid 500); 25 Jun 2019 09:21:35 -0000
Mailing-List: contact user-help@kylin.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:user-help@kylin.apache.org>
List-Unsubscribe: <mailto:user-unsubscribe@kylin.apache.org>
List-Post: <mailto:user@kylin.apache.org>
List-Id: <user.kylin.apache.org>
Reply-To: user@kylin.apache.org
Delivered-To: mailing list user@kylin.apache.org
Received: (qmail 39991 invoked by uid 99); 25 Jun 2019 09:21:35 -0000
Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142)
    by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Jun 2019 09:21:35 +0000
Received: from localhost (localhost [127.0.0.1])
	by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id AB1A61800C9
	for <user@kylin.apache.org>; Tue, 25 Jun 2019 09:21:34 +0000 (UTC)
X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org
X-Spam-Flag: NO
X-Spam-Score: 2.802
X-Spam-Level: **
X-Spam-Status: No, score=2.802 tagged_above=-999 required=6.31
	tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1,
	DKIM_VALID_EF=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=2, KAM_SHORT=0.001,
	RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
	URIBL_BLOCKED=0.001] autolearn=disabled
Authentication-Results: spamd3-us-west.apache.org (amavisd-new);
	dkim=pass (2048-bit key) header.d=gmail.com
Received: from mx1-lw-us.apache.org ([10.40.0.8])
	by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024)
	with ESMTP id HAIRLwXemLoz for <user@kylin.apache.org>;
	Tue, 25 Jun 2019 09:21:32 +0000 (UTC)
Received: from mail-io1-f67.google.com (mail-io1-f67.google.com [209.85.166.67])
	by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 6B5035F491
	for <user@kylin.apache.org>; Tue, 25 Jun 2019 09:21:32 +0000 (UTC)
Received: by mail-io1-f67.google.com with SMTP id i10so331417iol.13
        for <user@kylin.apache.org>; Tue, 25 Jun 2019 02:21:32 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20161025;
        h=mime-version:references:in-reply-to:from:date:message-id:subject:to;
        bh=kq9X+XMAT2RezEfazC0KIE3JPASvm5TM6JzhQyOfE5Y=;
        b=d1MZff8p6LBmLWDZfx642I6mvFOlrLu8auCcgFqEHnz7r1iVV1oqFG/uMr9m5lD6Nj
         Scm7uDELzcoC2yLHpugnmdxjCdku3PnMUGyrwkq43sItOyoGBk60RRG9yx81AsYzIdFD
         qANfZOyatXV1/gOt8+Jl3c1ZRxZ7CquILFVRt4K8Oxc3qB6nZNQMw4+QUgs92OUKcQkK
         CV2rtUjiB3nTN07URIrcCBLDCuq83gtRGW3NUvttyPeDb1qTNKxn10K1UjCRzsWCeWmG
         /BEjEwOtsTCVNXq5HyaErB8cL9M53MM2ciMWumpaYUp7vpyt7V9CwIHMcgg0jwhX+E15
         O2Iw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20161025;
        h=x-gm-message-state:mime-version:references:in-reply-to:from:date
         :message-id:subject:to;
        bh=kq9X+XMAT2RezEfazC0KIE3JPASvm5TM6JzhQyOfE5Y=;
        b=q2tk9ahcd+z+H3301EFSdnJiF5pim87QA4CsiLVTnp4RMZacT5XlB6/NLNwHZnwFpF
         bFCo/KbanamTS9jPx9gEJTOBJ3AZOtEvkyqTSiEAsFjyXFMGoibFeciCaB6SKGqQw4/M
         kVP3lalPVaVTN1Wp8/FWkYL7D+VQVsnPJ8mxv95AH7H+++W4zVdfiAoC8ASxt7d5svEe
         JLa9TBU19+MfRMdkTglTzcTMSIORubb62LztUiDQj1+TCuoFILjUhP3DTjH3t1rneGe8
         vg7pt7KDRgdT5CWh+NmuCLHaMAw3VRCqFWnnOZ2ZGcEP6D75/XoWHZvxBdqtCQgBo64d
         X1fg==
X-Gm-Message-State: APjAAAX/AdYrkTLGQYrnacmNPyjobMb8ac8VaV88uvGrlFd2sJvcIg1Z
	4oiruR2DOG7o2HDcL9M0hnQSHy0Rqh34XAI9YOlq9PeGmX4=
X-Google-Smtp-Source: APXvYqy53c21+IHfA4Fo0MNpcALpRChXuTY4wtSJOLrYx6f0esMyWTjN+PWuWPoQTW+3PTSgbvSigQNXsnL6QsOmh4k=
X-Received: by 2002:a02:554a:: with SMTP id e71mr30397587jab.144.1561454491499;
 Tue, 25 Jun 2019 02:21:31 -0700 (PDT)
MIME-Version: 1.0
References: <CAAbt_FUPU+1DXhNg3J8DwPStxKYXCXxEbju8wd8mZL_5XtqEsA@mail.gmail.com>
 <2b1c8d4.4c72.16b508bb557.Coremail.mg4work@163.com> <CAAbt_FXnuHbs9X10-KRsAAeaie9ferSeMS8fvKutF+ZJqrTFCw@mail.gmail.com>
 <153ebe88.966f.16b544baf61.Coremail.mg4work@163.com> <CAAbt_FXEpnK8tgDzqiCexQ7gnV8N71EfQTGwowWFZQ2vO7rENA@mail.gmail.com>
 <CANfpUcsZxLiKR=uDv2UmirVhuB3M6a3=NAtzvV278TegkjK9Pw@mail.gmail.com>
In-Reply-To: <CANfpUcsZxLiKR=uDv2UmirVhuB3M6a3=NAtzvV278TegkjK9Pw@mail.gmail.com>
From: Andras Nagy <andras.istvan.nagy@gmail.com>
Date: Tue, 25 Jun 2019 11:20:55 +0200
Message-ID: <CAAbt_FXm+a8xOwKVNj1K342gtpOtPV1=UU=upLsOkkQ_SH_-bw@mail.gmail.com>
Subject: Re: Re: Kylin streaming questions
To: user@kylin.apache.org
Content-Type: multipart/alternative; boundary="0000000000004c7f1c058c22744f"

--0000000000004c7f1c058c22744f
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi ShaoFeng,

Thanks a lot for the pointer on the lambda mode, yes, that's exactly what I
need :)

Is there perhaps documentation on this? For now, I was trying to get this
working 'empirically' and finally succeeded, but some of my conclusions may
be wrong. This is what I concluded:

- hive table must have the same name as the streaming table (name given to
the data source)
- cube can't be built from UI (to build the historic segments from the data
in hive), but it can be built using the REST API
- cube build engine must be mapreduce. For Spark as build engine I got
exception "Cannot adapt to interface
org.apache.kylin.engine.spark.ISparkOutput"
- endTime must be non-overlapping with the streaming data. When I had
overlap, the streaming data coming from kafka did not show up in the
output, I guess this is what you meant by "the segments from Hive will
overwrite the segments from Kafka".

Are these correct conclusions? Is there anything else I should be aware of?

Many thanks,
Andras

On Tue, Jun 25, 2019 at 9:19 AM ShaoFeng Shi <shaofengshi@apache.org> wrote=
:

> Hello Andras,
>
> Kylin's realtime-OLAP feature supports a "Lambda" mode (mentioned in
> https://kylin.apache.org/blog/2019/04/12/rt-streaming-design/), which
> means, you can define a fact table whose data can be from both Kafka and
> Hive. The only requirement is that all the cube columns appear in both
> Kafka data and Hive data. I think maybe that can fit your need. The cube
> can be built from Kafka, in the meanwhile, it can also be built from Hive=
,
> the segments from Hive will overwrite the segments from Kafka (as usually
> Hive data is more accurate). When querying the cube, Kylin will firstly
> query historical segments, and then real-time segments (adding the max-ti=
me
> of historical segments as the condition).
>
>
> Best regards,
>
> Shaofeng Shi =E5=8F=B2=E5=B0=91=E9=94=8B
> Apache Kylin PMC
> Email: shaofengshi@apache.org
>
> Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html
> Join Kylin user mail group: user-subscribe@kylin.apache.org
> Join Kylin dev mail group: dev-subscribe@kylin.apache.org
>
>
>
>
> Andras Nagy <andras.istvan.nagy@gmail.com> =E4=BA=8E2019=E5=B9=B46=E6=9C=
=8824=E6=97=A5=E5=91=A8=E4=B8=80 =E4=B8=8B=E5=8D=8811:29=E5=86=99=E9=81=93=
=EF=BC=9A
>
>> Dear Ma,
>>
>> Thanks for your reply.
>>
>> Slightly related to my original question on the hybrid model, I was
>> wondering if it's possible to combine a batch and a streaming cube. I
>> realized this is not possible, as a hybrid model can only be created fro=
m
>> cubes of the same model (and a model points to either a batch or a
>> streaming datasource).
>>
>> The usecase would be this:
>> - we have a large amount of streaming data in Kafka that we would like t=
o
>> process with Kylin streaming
>> - Kafka retention is only a few days, so if we need to change anything i=
n
>> the cubes (e.g. introduce a new metric or dimension which has been prese=
nt
>> in the events, but not in the cube definition), we can only reprocess a =
few
>> days worth of data in the streaming model
>> - the raw events are also written to a data lake for long-term storage
>> - the data written to the data lake could be used to feed the historic
>> data into a batch kylin model (and cubes)
>> - I'm looking for a way to combine these, so if we want to change
>> anything in the cubes, we can recalculate them for the historic data as =
well
>>
>> Is there a way to achieve this with current Kylin? (Without implementing
>> a custom query layer that combines the two cubes.)
>>
>> Best regards,
>> Andras
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> On Fri, Jun 14, 2019 at 6:43 AM Ma Gang <mg4work@163.com> wrote:
>>
>>> Hi Andras,
>>>
>>> Currently it doesn't support consume from specified offsets, only
>>> support consume from startOffset or latestOffset, if you want to consum=
e
>>> from startOffset, you need to set the
>>> configuration: kylin.stream.consume.offsets.latest to false in the cube=
's
>>> overrides page.
>>>
>>> If you do need to start from specified offsets, please create a jira
>>> request, but I think it is hard for user to know what's the offsets sho=
uld
>>> be set for all partitions.
>>>
>>> At 2019-06-13 22:34:59, "Andras Nagy" <andras.istvan.nagy@gmail.com>
>>> wrote:
>>>
>>> Dear Ma,
>>>
>>> Thank you very much!
>>>
>>> >1)yes, you can specify a configuration in the new cube, to consume
>>> data from start offset
>>> That is, an offset value for each partition of the topic? That would be
>>> good - could you please point me where to do this in practice, or point=
 me
>>> to what I should read? (I haven't found it on the cube designer UI -
>>> perhaps this is something that's only available on the API?)
>>>
>>> Many thanks,
>>> Andras
>>>
>>>
>>>
>>> On Thu, Jun 13, 2019 at 1:14 PM Ma Gang <mg4work@163.com> wrote:
>>>
>>>> Hi Andras,
>>>> 1)yes, you can specify a configuration in the new cube, to consume dat=
a
>>>> from start offset
>>>>
>>>> 2)It should work, but I haven't tested it yet
>>>>
>>>> 3)as I remember, currently we use Kafka 1.0 client library, so it is
>>>> better to use the version later, I'm sure that the version before 0.9.=
0
>>>> cannot work, but not sure 0.9.x can work or not
>>>>
>>>>
>>>>
>>>> Ma Gang
>>>> =E9=82=AE=E7=AE=B1=EF=BC=9Amg4work@163.com
>>>>
>>>> <https://maas.mail.163.com/dashi-web-extend/html/proSignature.html?ftl=
Id=3D1&name=3DMa+Gang&uid=3Dmg4work%40163.com&iconUrl=3Dhttps%3A%2F%2Fmail-=
online.nosdn.127.net%2Fqiyelogo%2FdefaultAvatar.png&items=3D%5B%22%E9%82%AE=
%E7%AE%B1%EF%BC%9Amg4work%40163.com%22%5D>
>>>>
>>>> =E7=AD=BE=E5=90=8D=E7=94=B1 =E7=BD=91=E6=98=93=E9=82=AE=E7=AE=B1=E5=A4=
=A7=E5=B8=88 <https://mail.163.com/dashi/dlpro.html?from=3Dmail88> =E5=AE=
=9A=E5=88=B6
>>>>
>>>> On 06/13/2019 18:01, Andras Nagy <andras.istvan.nagy@gmail.com> wrote:
>>>> Greetings,
>>>>
>>>> I have a few questions related to the new streaming (real-time OLAP)
>>>> implementation.
>>>>
>>>> 1) Is there a way to have data reprocessed from kafka? E.g. I change a
>>>> cube definition and drop the cube (or add a new cube definition) and w=
ant
>>>> to have data that is still available on kafka to be reprocessed to bui=
ld
>>>> the changed cube (or new cube)? Is this possible?
>>>>
>>>> 2) Does the hybrid model work with streaming cubes (to combine two
>>>> cubes)?
>>>>
>>>> 3) What is minimum kafka version required? The tutorial asks to instal=
l
>>>> Kafka 1.0, is this the minimum required version?
>>>>
>>>> Thank you very much,
>>>> Andras
>>>>
>>>>
>>>
>>>
>>>
>>

--0000000000004c7f1c058c22744f
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr">Hi ShaoFeng,<br><br>Thanks a lot for the =
pointer on the lambda mode, yes, that&#39;s exactly what I need :)<br><br>I=
s there perhaps documentation on this? For now, I was trying to get this wo=
rking &#39;empirically&#39; and finally succeeded, but some of my conclusio=
ns may be wrong. This is what I concluded:<br><br>- hive table must have th=
e same name as the streaming table (name given to the data source)<br>- cub=
e can&#39;t be built from UI (to build the historic segments from the data =
in hive), but it can be built using the REST API<br>- cube build engine mus=
t be mapreduce. For Spark as build engine I got exception &quot;Cannot adap=
t to interface org.apache.kylin.engine.spark.ISparkOutput&quot;<br>- endTim=
e must be non-overlapping with the streaming data. When I had overlap, the =
streaming data coming from kafka did not show up in the output, I guess thi=
s is what you meant by &quot;the segments from Hive will overwrite the segm=
ents from Kafka&quot;.<br><br>Are these correct conclusions? Is there anyth=
ing else I should be=C2=A0aware of?<br><br>Many thanks,<br>Andras<br></div>=
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Tue=
, Jun 25, 2019 at 9:19 AM ShaoFeng Shi &lt;<a href=3D"mailto:shaofengshi@ap=
ache.org">shaofengshi@apache.org</a>&gt; wrote:<br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex"><div dir=3D"ltr">Hello Andras,<div><br></d=
iv><div>Kylin&#39;s realtime-OLAP feature supports a &quot;Lambda&quot; mod=
e (mentioned in=C2=A0<a href=3D"https://kylin.apache.org/blog/2019/04/12/rt=
-streaming-design/" target=3D"_blank">https://kylin.apache.org/blog/2019/04=
/12/rt-streaming-design/</a>), which means, you can define a fact table who=
se data can be from both Kafka and Hive. The only requirement is that all t=
he cube columns appear in both Kafka data and Hive data. I think maybe that=
 can fit your need. The cube can be built from Kafka, in the meanwhile, it =
can also be built from Hive, the segments from Hive will overwrite the segm=
ents from Kafka (as usually Hive data is more accurate). When querying the =
cube, Kylin will firstly query historical segments, and then real-time segm=
ents (adding the max-time of historical segments as the condition).</div><d=
iv><br></div><div><br clear=3D"all"><div><div dir=3D"ltr" class=3D"gmail-m_=
-6911245916019193026gmail_signature"><div dir=3D"ltr"><div><div dir=3D"ltr"=
><div dir=3D"ltr"><div dir=3D"ltr"><div dir=3D"ltr">Best regards,<div><br><=
/div><div>Shaofeng Shi =E5=8F=B2=E5=B0=91=E9=94=8B</div><div>Apache Kylin P=
MC</div><div>Email: <a href=3D"mailto:shaofengshi@apache.org" target=3D"_bl=
ank">shaofengshi@apache.org</a></div><div><br></div><div>Apache Kylin FAQ:=
=C2=A0<a href=3D"https://kylin.apache.org/docs/gettingstarted/faq.html" tar=
get=3D"_blank">https://kylin.apache.org/docs/gettingstarted/faq.html</a></d=
iv><div>Join Kylin user mail group: <a href=3D"mailto:user-subscribe@kylin.=
apache.org" target=3D"_blank">user-subscribe@kylin.apache.org</a></div><div=
>Join Kylin dev mail group: <a href=3D"mailto:dev-subscribe@kylin.apache.or=
g" target=3D"_blank">dev-subscribe@kylin.apache.org</a><br></div><div><br><=
/div><div><br></div></div></div></div></div></div></div></div></div><br></d=
iv></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_att=
r">Andras Nagy &lt;<a href=3D"mailto:andras.istvan.nagy@gmail.com" target=
=3D"_blank">andras.istvan.nagy@gmail.com</a>&gt; =E4=BA=8E2019=E5=B9=B46=E6=
=9C=8824=E6=97=A5=E5=91=A8=E4=B8=80 =E4=B8=8B=E5=8D=8811:29=E5=86=99=E9=81=
=93=EF=BC=9A<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px=
 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><di=
v dir=3D"ltr">Dear Ma,<div><br></div><div>Thanks for your reply.</div><div>=
<br></div><div>Slightly related to my original=C2=A0question on the hybrid =
model, I was wondering if it&#39;s possible to combine a batch and a stream=
ing cube. I realized this is not possible, as a hybrid model can only be cr=
eated from cubes of the same model (and a model points to either a batch or=
 a streaming datasource).</div><div><br></div><div>The usecase would be thi=
s:</div><div>- we have a large amount of streaming data in Kafka that we wo=
uld like to process with Kylin streaming</div><div>- Kafka retention is onl=
y a few days, so if we need to change anything in the cubes (e.g. introduce=
 a new metric or dimension which has been present in the events, but not in=
 the cube definition), we can only reprocess a few days worth of data in th=
e streaming model</div><div>- the raw events are also written to a data lak=
e for long-term storage</div><div>- the data written to the data lake could=
 be used to feed the historic data into a batch kylin model (and cubes)</di=
v><div>- I&#39;m looking for a way to combine these, so if we want to chang=
e anything in the cubes, we can recalculate them for the historic data as w=
ell</div><div><br></div><div>Is there a way to achieve this with current Ky=
lin? (Without implementing a custom query layer that combines the two cubes=
.)</div><div><br></div><div>Best regards,</div><div>Andras</div><div><br></=
div><div><br></div><div><br></div><div><br></div><div><br></div><div><br></=
div><div><br></div><div><br></div><div><br></div></div><br><div class=3D"gm=
ail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Fri, Jun 14, 2019 at 6:=
43 AM Ma Gang &lt;<a href=3D"mailto:mg4work@163.com" target=3D"_blank">mg4w=
ork@163.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=
=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding=
-left:1ex"><div style=3D"line-height:1.7;color:rgb(0,0,0);font-size:14px;fo=
nt-family:Arial"><div>Hi Andras,</div><div><br></div><div>Currently it does=
n&#39;t support consume from specified offsets, only support consume from s=
tartOffset or latestOffset, if you want to consume from startOffset, you ne=
ed to set the configuration:=C2=A0kylin.stream.consume.offsets.latest to fa=
lse in the cube&#39;s overrides page.</div><div><br></div>If you do need to=
 start from specified offsets, please create a jira request, but I think it=
 is hard for user to know what&#39;s the offsets should be set for all part=
itions.<br><div style=3D"zoom:1"></div><div id=3D"gmail-m_-6911245916019193=
026gmail-m_2755219309041550927gmail-m_-3427084094008586989divNeteaseMailCar=
d"></div><br>At 2019-06-13 22:34:59, &quot;Andras Nagy&quot; &lt;<a href=3D=
"mailto:andras.istvan.nagy@gmail.com" target=3D"_blank">andras.istvan.nagy@=
gmail.com</a>&gt; wrote:<br> <blockquote id=3D"gmail-m_-6911245916019193026=
gmail-m_2755219309041550927gmail-m_-3427084094008586989isReplyContent" styl=
e=3D"padding-left:1ex;margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(20=
4,204,204)"><div dir=3D"ltr">Dear Ma,<div><br></div><div>Thank you very muc=
h!</div><div><br></div><div>&gt;<span style=3D"font-family:=E8=8B=B9=E6=96=
=B9,=E5=BE=AE=E8=BD=AF=E9=9B=85=E9=BB=91,sans-serif">1)yes, you can specify=
 a configuration in the new cube, to consume data from start offset</span><=
/div><div><span style=3D"font-family:=E8=8B=B9=E6=96=B9,=E5=BE=AE=E8=BD=AF=
=E9=9B=85=E9=BB=91,sans-serif">That is, an offset value for each partition =
of the topic? That would be good - could you please point me where to do th=
is in practice, or point me to what I should read? (I haven&#39;t found it =
on the cube designer UI - perhaps this is something that&#39;s only availab=
le on the API?)</span></div><div><span style=3D"font-family:=E8=8B=B9=E6=96=
=B9,=E5=BE=AE=E8=BD=AF=E9=9B=85=E9=BB=91,sans-serif"><br></span></div><div>=
<span style=3D"font-family:=E8=8B=B9=E6=96=B9,=E5=BE=AE=E8=BD=AF=E9=9B=85=
=E9=BB=91,sans-serif">Many thanks,</span></div><div><span style=3D"font-fam=
ily:=E8=8B=B9=E6=96=B9,=E5=BE=AE=E8=BD=AF=E9=9B=85=E9=BB=91,sans-serif">And=
ras</span></div><div><span style=3D"font-family:=E8=8B=B9=E6=96=B9,=E5=BE=
=AE=E8=BD=AF=E9=9B=85=E9=BB=91,sans-serif"><br></span></div><div><span styl=
e=3D"font-family:=E8=8B=B9=E6=96=B9,=E5=BE=AE=E8=BD=AF=E9=9B=85=E9=BB=91,sa=
ns-serif"><br></span></div></div><br><div class=3D"gmail_quote"><div dir=3D=
"ltr" class=3D"gmail_attr">On Thu, Jun 13, 2019 at 1:14 PM Ma Gang &lt;<a h=
ref=3D"mailto:mg4work@163.com" target=3D"_blank">mg4work@163.com</a>&gt; wr=
ote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px=
 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

   =20

<div>
<div style=3D"line-height:1.6;font-family:=E8=8B=B9=E6=96=B9,=E5=BE=AE=E8=
=BD=AF=E9=9B=85=E9=BB=91,sans-serif">Hi Andras,<br>1)yes, you can specify a=
 configuration in the new cube, to consume data from start offset<br><br>2)=
It should work, but I haven&#39;t tested it yet<br><br>3)as I remember, cur=
rently we use Kafka 1.0 client library, so it is better to use the version =
later, I&#39;m sure that the version before 0.9.0 cannot work, but not sure=
 0.9.x can work or not<br><br><br><br></div><div class=3D"gmail-m_-69112459=
16019193026gmail-m_2755219309041550927gmail-m_-3427084094008586989gmail-m_1=
414968820766429522ne-quoted">        <a href=3D"https://maas.mail.163.com/d=
ashi-web-extend/html/proSignature.html?ftlId=3D1&amp;name=3DMa+Gang&amp;uid=
=3Dmg4work%40163.com&amp;iconUrl=3Dhttps%3A%2F%2Fmail-online.nosdn.127.net%=
2Fqiyelogo%2FdefaultAvatar.png&amp;items=3D%5B%22%E9%82%AE%E7%AE%B1%EF%BC%9=
Amg4work%40163.com%22%5D" style=3D"display:block;background:rgb(255,255,255=
);max-width:400px;padding:15px 0px 10px;text-decoration:none;outline:none" =
target=3D"_blank">
            <table cellpadding=3D"0" style=3D"width:100%;max-width:100%;tab=
le-layout:fixed;border-collapse:collapse;color:rgb(155,158,161);font-size:1=
4px;line-height:1.3">
                <tbody style=3D"word-break:break-all;font-family:&quot;Ping=
Fang SC&quot;,&quot;Hiragino Sans GB&quot;,&quot;WenQuanYi Micro Hei&quot;,=
&quot;Microsoft Yahei&quot;,=E5=BE=AE=E8=BD=AF=E9=9B=85=E9=BB=91,verdana">
                    <tr>
                            <td width=3D"38" style=3D"padding:0px;box-sizin=
g:border-box;width:38px">
                                <img width=3D"38" height=3D"38" style=3D"ve=
rtical-align: middle; width: 38px; height: 38px; border-radius: 50%;" src=
=3D"https://mail-online.nosdn.127.net/qiyelogo/defaultAvatar.png">
                            </td>
                            <td style=3D"padding:0px 0px 0px 10px;color:rgb=
(49,53,59)">
                                <div style=3D"font-size:16px;font-weight:bo=
ld;width:100%;white-space:nowrap;overflow:hidden;text-overflow:ellipsis">Ma=
 Gang</div>
                            </td>
                    </tr>
                        <tr width=3D"100%" style=3D"width:100%;font-size:14=
px">
                            <td colspan=3D"2" style=3D"padding:10px 0px 0px=
;width:100%;font-size:14px">
                                    <div style=3D"width:100%;word-break:bre=
ak-all;font-size:14px">=E9=82=AE=E7=AE=B1=EF=BC=9Amg4work@163.com</div>
                            </td>
                        </tr>
                </tbody>
            </table>
        </a><div><p style=3D"border-top:1px solid rgb(229,229,229);padding-=
top:8px;font-size:12px;color:rgb(182,184,187);line-height:1.833">=E7=AD=BE=
=E5=90=8D=E7=94=B1 <a href=3D"https://mail.163.com/dashi/dlpro.html?from=3D=
mail88" style=3D"color:rgb(106,168,246);text-decoration:none" target=3D"_bl=
ank">=E7=BD=91=E6=98=93=E9=82=AE=E7=AE=B1=E5=A4=A7=E5=B8=88</a> =E5=AE=9A=
=E5=88=B6</p></div><blockquote id=3D"gmail-m_-6911245916019193026gmail-m_27=
55219309041550927gmail-m_-3427084094008586989gmail-m_1414968820766429522nte=
s-andriodmail-quote" style=3D"margin:0px;padding:0px;border:none"><blockquo=
te id=3D"gmail-m_-6911245916019193026gmail-m_2755219309041550927gmail-m_-34=
27084094008586989gmail-m_1414968820766429522ntes-andriodmail-quote" style=
=3D"margin:0px;padding:0px;border:none"><div class=3D"gmail-m_-691124591601=
9193026gmail-m_2755219309041550927gmail-m_-3427084094008586989gmail-m_14149=
68820766429522J-reply" style=3D"background-color:rgb(242,242,242);color:bla=
ck;padding-top:6px;padding-bottom:6px;border-radius:3px;margin-top:45px;mar=
gin-bottom:20px">
    <div style=3D"font-size:14px;line-height:1.5;word-break:break-all;margi=
n-left:10px;margin-right:10px">On <span class=3D"gmail-m_-69112459160191930=
26gmail-m_2755219309041550927gmail-m_-3427084094008586989gmail-m_1414968820=
766429522mail-date">06/13/2019 18:01</span>, <a class=3D"gmail-m_-691124591=
6019193026gmail-m_2755219309041550927gmail-m_-3427084094008586989gmail-m_14=
14968820766429522mail-to" style=3D"text-decoration:none;color:rgb(42,151,25=
5)" href=3D"mailto:andras.istvan.nagy@gmail.com" target=3D"_blank">Andras N=
agy</a> wrote:</div>
</div><div dir=3D"ltr">Greetings,<div><br><div>I have a few questions relat=
ed to the new streaming (real-time OLAP) implementation.</div><div><br></di=
v><div>1) Is there a way to have data reprocessed from kafka? E.g. I change=
 a cube definition and drop the cube (or add a new cube definition) and wan=
t to have data that is still available on kafka to be reprocessed to build =
the changed cube (or new cube)? Is this possible?</div><div><br></div>2) Do=
es the hybrid model work with streaming cubes (to combine two cubes)?<br><b=
r><div>				3) What is minimum kafka version required? The tutorial asks to =
install Kafka 1.0, is this the minimum required version?</div><div><br></di=
v><div>Thank you very much,</div><div>Andras</div></div></div>
</blockquote></blockquote></div>
</div>
</blockquote></div>
</blockquote></div><br><br><span title=3D"neteasefooter"><p>=C2=A0</p></spa=
n></blockquote></div>
</blockquote></div>
</blockquote></div></div>

--0000000000004c7f1c058c22744f--