From user-return-3008-archive-asf-public=cust-asf.ponee.io@kylin.apache.org  Sun Feb 18 03:23:26 2018
Return-Path: <user-return-3008-archive-asf-public=cust-asf.ponee.io@kylin.apache.org>
X-Original-To: archive-asf-public@cust-asf.ponee.io
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [140.211.11.3])
	by mx-eu-01.ponee.io (Postfix) with SMTP id D4D3B180657
	for <archive-asf-public@cust-asf.ponee.io>; Sun, 18 Feb 2018 03:23:24 +0100 (CET)
Received: (qmail 23150 invoked by uid 500); 18 Feb 2018 02:23:23 -0000
Mailing-List: contact user-help@kylin.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:user-help@kylin.apache.org>
List-Unsubscribe: <mailto:user-unsubscribe@kylin.apache.org>
List-Post: <mailto:user@kylin.apache.org>
List-Id: <user.kylin.apache.org>
Reply-To: user@kylin.apache.org
Delivered-To: mailing list user@kylin.apache.org
Received: (qmail 23141 invoked by uid 99); 18 Feb 2018 02:23:23 -0000
Received: from mail-relay.apache.org (HELO mailrelay1-lw-us.apache.org) (207.244.88.152)
    by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 18 Feb 2018 02:23:23 +0000
Received: from mail-pg0-f41.google.com (mail-pg0-f41.google.com [74.125.83.41])
	by mailrelay1-lw-us.apache.org (ASF Mail Server at mailrelay1-lw-us.apache.org) with ESMTPSA id 430B8623
	for <user@kylin.apache.org>; Sun, 18 Feb 2018 02:23:21 +0000 (UTC)
Received: by mail-pg0-f41.google.com with SMTP id f6so4571893pgs.10
        for <user@kylin.apache.org>; Sat, 17 Feb 2018 18:23:21 -0800 (PST)
X-Gm-Message-State: APf1xPB0bnGQOefdoACFPQzmZKUUwFwMe9nwb1lp3tDoBv4/Cv7K5UkI
	emIB2SbvvIajWIQ37xGGLSxOpkNx35+B9negi8c=
X-Google-Smtp-Source: AH8x226rxtrMYxyDG7RUvXmup/4x9nS1LOk7T8X6L+jvtTIl1DzavHMsD0/XoQAlSaGtyIMrATxi8MUapSgyFxW0E6c=
X-Received: by 10.99.55.65 with SMTP id g1mr8902298pgn.284.1518920600353; Sat,
 17 Feb 2018 18:23:20 -0800 (PST)
MIME-Version: 1.0
Received: by 10.100.160.240 with HTTP; Sat, 17 Feb 2018 18:22:39 -0800 (PST)
In-Reply-To: <8527708fb5a54531b067170446db344d@SNP02SEM02.bureau.si.interne>
References: <8527708fb5a54531b067170446db344d@SNP02SEM02.bureau.si.interne>
From: ShaoFeng Shi <shaofengshi@apache.org>
Date: Sun, 18 Feb 2018 10:22:39 +0800
X-Gmail-Original-Message-ID: <CANfpUcuoWjtUNAvMkh026KwgmmN1x212dEziT8i_J3gYbdAYAg@mail.gmail.com>
Message-ID: <CANfpUcuoWjtUNAvMkh026KwgmmN1x212dEziT8i_J3gYbdAYAg@mail.gmail.com>
Subject: Re: usage of Web inteface Kylin an performances
To: user <user@kylin.apache.org>
Content-Type: multipart/alternative; boundary="94eb2c0bef54d383cf0565734239"

--94eb2c0bef54d383cf0565734239
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi Jean-luc,

Most of the Kylin developers are in the new year holiday, so there might be
some delay. Here are some comments from my side:

1.  I presume that the whole .json files are stored, is it right ?
yes
2. Do these kinds of tables contain the cube data ?
yes; cube are stored in HBase with "KYLINL_" as prefix
3. So I am wondering if it is the good method
the "compression" in Tomcat/conf/server.xml has nothing with cube build. To
enable compression for cube, you need to configure that in your Hadoop
configurations like mapred-site.xml, hbase-site.xml or
kylin/conf/kylin_job_conf.xml.
4. How is it possible to optimize cube size to keep good performance ?
https://kylin.apache.org/docs21/howto/howto_optimize_cubes.html
5.  Is it through the =E2=80=98rowkeys=E2=80=99 in the advanced settings wh=
en you build the
cube ?
yes, exactly; putting the most used filtering column to the heading
position on the rowkey can get better performance.
6. What shall we put exactly in the =E2=80=98Rowkeys=E2=80=99 section ?
All dimensions (excluding 'derived' dimensions) need be on rowkey; If you
see too many columns in the agg. group, remove some dimensions from your
cube.
7.  Are the aggregation groups used for speed of the queries.
The agg. group is used to optimize the dimension combinations. For a N
dimension cube, by default it will have 2^N combinations (we called
cuboid). If you can divide N dimensions to several groups, the combination
numbers can be greatly reduced, so the cube build will be much easier and
taking much less space. How to define the agg. group? You can do that with
your business query patterns.


2018-02-14 1:49 GMT+08:00 BELLIER Jean-luc <jean-luc.bellier@rte-france.com=
>
:

> Hello,
>
>
>
> I have several questions on Kylin, especially about performances and how
> to manage them. I would like to understand precisely how it works to see =
if
> I can use it in my business context.
>
>
>
> I come from the relational database world, so as far as I understand on
> OLAP, the searches are performed on the values of primary keys in
> dimensions. These subsets are then joined to get the corresponding lines =
on
> the facts table. As the dimensions tables are much smaller than the facts
> table, the queries run faster
>
>
>
> *1.       **Questions on performances*
>
> =C2=B7         the raw data are stored in Hive, and the models and struct=
ures
> (cubes) are stored in HBase; I presume that the whole .json files are
> stored, is it right ?
>
> =C2=B7         Where are the cube results stores (I mean after a build, a
> refresh or an append action). Is it also in HBase ? I can see in HBase
> tables like "KYLIN_FF46WDAAGH". Do these kinds of tables contain the cube
> data ?
>
> =C2=B7         I noticed that when I build the =E2=80=98sample_cube=E2=80=
=99, the volume of
> data was very important compared to the size of the original files. Is
> there a way to reduce it (I saw a attribute in the $KYLIN_HOME/tomcat/con=
f/server.xml
> file, called =E2=80=98compression=E2=80=99 for the connector on port 7070=
, but I do not
> know if it is related to the cube size). I tried to change this parameter
> to =E2=80=98yes=E2=80=99, but I noticed a huge increase of the duration o=
f cube generation.
> So I am wondering if it is the good method.
>
> =C2=B7         How is it possible to optimize cube size to keep good
> performance ?
>
> =C2=B7         In Hive, putting indexes is not recommended. So how Kylin =
is
> ensuring good performance when querying high volumes of data  ? Is it
> through the =E2=80=98rowkeys=E2=80=99 in the advanced settings when you b=
uild the cube ?
>
> Or is the answer elsewhere ?
>
>
>
> *2.       **Questions on cube building*
>
> =C2=B7         By the way, the =E2=80=98Advanced settings=E2=80=99 step i=
s still unclear for
> me. I tried to build a cube from scratch using the tables provided in the
> sample project. But I do not know very much what to put in this section.
>
> =C2=B7         My goal is to define groups of data on YEAR_BEG_DT,
> QTR_BEG_DT,MONTH_BEG_DT.
>
> =C2=B7         I do not understand very well why the aggregation group
> contains so many columns. I tried to remove as many as possible, but when=
 I
> tried to set up the joins, but some fields were missing so the saving of
> the cube failed.
>
> =C2=B7         What shall we put exactly in the =E2=80=98Rowkeys=E2=80=99=
 section ? I
> understand that this is used to define data encoding (for speed access ?
> ).Am I right ?
>
> =C2=B7         Are the aggregation groups used for speed of the queries. =
I
> assume it is the case, because it represents the most commonly used
> associations of columns for the cube.
>
>
>
> Thank you in advance for your help.
>
>
>
> Best regards,
>
> Jean-Luc.
>
>
>
>
>
>
>
>
> "Ce message est destin=C3=A9 exclusivement aux personnes ou entit=C3=A9s =
auxquelles
> il est adress=C3=A9 et peut contenir des informations privil=C3=A9gi=C3=
=A9es ou
> confidentielles. Si vous avez re=C3=A7u ce document par erreur, merci de =
nous
> l'indiquer par retour, de ne pas le transmettre et de proc=C3=A9der =C3=
=A0 sa
> destruction.
>
> This message is solely intended for the use of the individual or entity t=
o
> which it is addressed and may contain information that is privileged or
> confidential. If you have received this communication by error, please
> notify us immediately by electronic mail, do not disclose it and delete t=
he
> original message."
>


--=20
Best regards,

Shaofeng Shi =E5=8F=B2=E5=B0=91=E9=94=8B

--94eb2c0bef54d383cf0565734239
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Jean-luc,<div><br></div><div>Most of the Kylin develope=
rs are in the new year holiday, so there might be some delay. Here are some=
 comments from my side:</div><div><br></div><div>1.=C2=A0<span style=3D"col=
or:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.6667px;font-s=
tyle:normal;font-variant-ligatures:normal;font-variant-caps:normal;font-wei=
ght:400;letter-spacing:normal;text-align:start;text-indent:0px;text-transfo=
rm:none;white-space:normal;word-spacing:0px;background-color:rgb(255,255,25=
5);text-decoration-style:initial;text-decoration-color:initial;float:none;d=
isplay:inline"><span>=C2=A0</span>I presume that the whole .json files are =
stored, is it right ?</span></div><div><span style=3D"color:rgb(31,73,125);=
font-family:Calibri,sans-serif;font-size:14.6667px;font-style:normal;font-v=
ariant-ligatures:normal;font-variant-caps:normal;font-weight:400;letter-spa=
cing:normal;text-align:start;text-indent:0px;text-transform:none;white-spac=
e:normal;word-spacing:0px;background-color:rgb(255,255,255);text-decoration=
-style:initial;text-decoration-color:initial;float:none;display:inline">yes=
</span></div><div><span style=3D"color:rgb(31,73,125);font-family:Calibri,s=
ans-serif;font-size:14.6667px;font-style:normal;font-variant-ligatures:norm=
al;font-variant-caps:normal;font-weight:400;letter-spacing:normal;text-alig=
n:start;text-indent:0px;text-transform:none;white-space:normal;word-spacing=
:0px;background-color:rgb(255,255,255);text-decoration-style:initial;text-d=
ecoration-color:initial;float:none;display:inline">2.=C2=A0<span style=3D"c=
olor:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.6667px;font=
-style:normal;font-variant-ligatures:normal;font-variant-caps:normal;font-w=
eight:400;letter-spacing:normal;text-align:start;text-indent:0px;text-trans=
form:none;white-space:normal;word-spacing:0px;background-color:rgb(255,255,=
255);text-decoration-style:initial;text-decoration-color:initial;float:none=
;display:inline">Do these kinds of tables contain the cube data ?</span></s=
pan></div><div><span style=3D"color:rgb(31,73,125);font-family:Calibri,sans=
-serif;font-size:14.6667px;font-style:normal;font-variant-ligatures:normal;=
font-variant-caps:normal;font-weight:400;letter-spacing:normal;text-align:s=
tart;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0p=
x;background-color:rgb(255,255,255);text-decoration-style:initial;text-deco=
ration-color:initial;float:none;display:inline">yes; cube are stored in HBa=
se with &quot;KYLINL_&quot; as prefix</span></div><div><span style=3D"color=
:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.6667px;font-sty=
le:normal;font-variant-ligatures:normal;font-variant-caps:normal;font-weigh=
t:400;letter-spacing:normal;text-align:start;text-indent:0px;text-transform=
:none;white-space:normal;word-spacing:0px;background-color:rgb(255,255,255)=
;text-decoration-style:initial;text-decoration-color:initial;float:none;dis=
play:inline">3<span style=3D"color:rgb(31,73,125);font-family:Calibri,sans-=
serif;font-size:14.6667px;font-style:normal;font-variant-ligatures:normal;f=
ont-variant-caps:normal;font-weight:400;letter-spacing:normal;text-align:st=
art;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px=
;background-color:rgb(255,255,255);text-decoration-style:initial;text-decor=
ation-color:initial;float:none;display:inline">. So I am wondering if it is=
 the good method</span></span></div><div><span style=3D"color:rgb(31,73,125=
);font-family:Calibri,sans-serif;font-size:14.6667px;font-style:normal;font=
-variant-ligatures:normal;font-variant-caps:normal;font-weight:400;letter-s=
pacing:normal;text-align:start;text-indent:0px;text-transform:none;white-sp=
ace:normal;word-spacing:0px;background-color:rgb(255,255,255);text-decorati=
on-style:initial;text-decoration-color:initial;float:none;display:inline">t=
he &quot;compression&quot; in Tomcat/conf/server.xml=C2=A0has nothing with =
cube build. To enable compression for cube, you need to configure that in y=
our Hadoop configurations like mapred-site.xml, hbase-site.xml or kylin/con=
f/kylin_job_conf.xml.</span></div><div><span style=3D"color:rgb(31,73,125);=
font-family:Calibri,sans-serif;font-size:14.6667px;font-style:normal;font-v=
ariant-ligatures:normal;font-variant-caps:normal;font-weight:400;letter-spa=
cing:normal;text-align:start;text-indent:0px;text-transform:none;white-spac=
e:normal;word-spacing:0px;background-color:rgb(255,255,255);text-decoration=
-style:initial;text-decoration-color:initial;float:none;display:inline">4.=
=C2=A0<span style=3D"color:rgb(31,73,125);font-family:Calibri,sans-serif;fo=
nt-size:14.6667px;font-style:normal;font-variant-ligatures:normal;font-vari=
ant-caps:normal;font-weight:400;letter-spacing:normal;text-align:start;text=
-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;backgro=
und-color:rgb(255,255,255);text-decoration-style:initial;text-decoration-co=
lor:initial;float:none;display:inline">How is it possible to optimize cube =
size to keep good performance ?<span>=C2=A0</span></span></span></div><div>=
<span style=3D"text-align:start;text-indent:0px;background-color:rgb(255,25=
5,255);text-decoration-style:initial;text-decoration-color:initial;float:no=
ne;display:inline"><span style=3D"text-align:start;text-indent:0px;backgrou=
nd-color:rgb(255,255,255);text-decoration-style:initial;text-decoration-col=
or:initial;float:none;display:inline"><font color=3D"#1f497d" face=3D"Calib=
ri, sans-serif"><span style=3D"font-size:14.6667px"><a href=3D"https://kyli=
n.apache.org/docs21/howto/howto_optimize_cubes.html">https://kylin.apache.o=
rg/docs21/howto/howto_optimize_cubes.html</a></span></font><br></span></spa=
n></div><div><span style=3D"text-align:start;text-indent:0px;background-col=
or:rgb(255,255,255);text-decoration-style:initial;text-decoration-color:ini=
tial;float:none;display:inline"><span style=3D"text-align:start;text-indent=
:0px;background-color:rgb(255,255,255);text-decoration-style:initial;text-d=
ecoration-color:initial;float:none;display:inline"><font color=3D"#1f497d" =
face=3D"Calibri, sans-serif"><span style=3D"font-size:14.6667px">5.=C2=A0<s=
pan style=3D"color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:=
14.6667px;font-style:normal;font-variant-ligatures:normal;font-variant-caps=
:normal;font-weight:400;letter-spacing:normal;text-align:start;text-indent:=
0px;text-transform:none;white-space:normal;word-spacing:0px;background-colo=
r:rgb(255,255,255);text-decoration-style:initial;text-decoration-color:init=
ial;float:none;display:inline"><span>=C2=A0</span>Is it through the =E2=80=
=98rowkeys=E2=80=99 in the advanced settings when you build the cube ?</spa=
n></span></font></span></span></div><div><font color=3D"#1f497d" face=3D"Ca=
libri, sans-serif"><span style=3D"font-size:14.6667px">yes, exactly; puttin=
g the most used filtering column to the heading position on the rowkey can =
get better performance.</span></font></div><div><font color=3D"#1f497d" fac=
e=3D"Calibri, sans-serif"><span style=3D"font-size:14.6667px">6.=C2=A0<span=
 style=3D"color:rgb(31,73,125);font-family:Calibri,sans-serif;font-size:14.=
6667px;font-style:normal;font-variant-ligatures:normal;font-variant-caps:no=
rmal;font-weight:400;letter-spacing:normal;text-align:start;text-indent:0px=
;text-transform:none;white-space:normal;word-spacing:0px;background-color:r=
gb(255,255,255);text-decoration-style:initial;text-decoration-color:initial=
;float:none;display:inline">What shall we put exactly in the =E2=80=98Rowke=
ys=E2=80=99 section ?</span></span></font></div><div><font color=3D"#1f497d=
" face=3D"Calibri, sans-serif"><span style=3D"font-size:14.6667px">All dime=
nsions (excluding &#39;derived&#39; dimensions) need be on rowkey; If you s=
ee too many columns in the agg. group, remove some dimensions from your cub=
e.</span></font></div><div><font color=3D"#1f497d" face=3D"Calibri, sans-se=
rif"><span style=3D"font-size:14.6667px">7.=C2=A0<span lang=3D"EN-US" style=
=3D"font-size:14.6667px;font-style:normal;font-variant-ligatures:normal;fon=
t-variant-caps:normal;font-weight:400;letter-spacing:normal;text-align:star=
t;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;b=
ackground-color:rgb(255,255,255);text-decoration-style:initial;text-decorat=
ion-color:initial;font-family:Symbol;color:rgb(31,73,125)"><span><span styl=
e=3D"font-style:normal;font-variant:normal;font-weight:normal;font-stretch:=
normal;font-size:7pt;line-height:normal;font-family:&quot;Times New Roman&q=
uot;"><span>=C2=A0</span></span></span></span><u style=3D"color:rgb(34,34,3=
4);font-family:Calibri,sans-serif;font-size:14.6667px;font-style:normal;fon=
t-variant-ligatures:normal;font-variant-caps:normal;font-weight:400;letter-=
spacing:normal;text-align:start;text-indent:0px;text-transform:none;white-s=
pace:normal;word-spacing:0px;background-color:rgb(255,255,255)"></u><span l=
ang=3D"EN-US" style=3D"font-family:Calibri,sans-serif;font-size:14.6667px;f=
ont-style:normal;font-variant-ligatures:normal;font-variant-caps:normal;fon=
t-weight:400;letter-spacing:normal;text-align:start;text-indent:0px;text-tr=
ansform:none;white-space:normal;word-spacing:0px;background-color:rgb(255,2=
55,255);text-decoration-style:initial;text-decoration-color:initial;color:r=
gb(31,73,125)">Are the aggregation groups used for speed of the queries.<sp=
an>=C2=A0</span></span></span></font></div><div><font color=3D"#1f497d" fac=
e=3D"Calibri, sans-serif"><span style=3D"font-size:14.6667px"><span lang=3D=
"EN-US" style=3D"font-family:Calibri,sans-serif;font-size:14.6667px;font-st=
yle:normal;font-variant-ligatures:normal;font-variant-caps:normal;font-weig=
ht:400;letter-spacing:normal;text-align:start;text-indent:0px;text-transfor=
m:none;white-space:normal;word-spacing:0px;background-color:rgb(255,255,255=
);text-decoration-style:initial;text-decoration-color:initial;color:rgb(31,=
73,125)"><span>The agg. group is used to optimize the dimension combination=
s. For a N dimension cube, by default it will have 2^N combinations (we cal=
led cuboid). If you can divide N dimensions to several groups, the combinat=
ion numbers can be greatly reduced, so the cube build will be much easier a=
nd taking much less space. How to define the agg. group? You can do that wi=
th your business query patterns.=C2=A0</span></span></span></font></div><di=
v><font color=3D"#1f497d" face=3D"Calibri, sans-serif"><span style=3D"font-=
size:14.6667px">=C2=A0</span></font></div><div><span style=3D"color:rgb(31,=
73,125);font-family:Calibri,sans-serif;font-size:14.6667px;font-style:norma=
l;font-variant-ligatures:normal;font-variant-caps:normal;font-weight:400;le=
tter-spacing:normal;text-align:start;text-indent:0px;text-transform:none;wh=
ite-space:normal;word-spacing:0px;background-color:rgb(255,255,255);text-de=
coration-style:initial;text-decoration-color:initial;float:none;display:inl=
ine"><br></span></div></div><div class=3D"gmail_extra"><br><div class=3D"gm=
ail_quote">2018-02-14 1:49 GMT+08:00 BELLIER Jean-luc <span dir=3D"ltr">&lt=
;<a href=3D"mailto:jean-luc.bellier@rte-france.com" target=3D"_blank">jean-=
luc.bellier@rte-france.com</a>&gt;</span>:<br><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex=
">


<div lang=3D"FR" link=3D"#0563C1" vlink=3D"#954F72">
<div class=3D"m_3029918813905275688WordSection1">
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">Hello,<u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d"><u></u>=C2=A0<u></u></=
span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d">I have =
several questions on Kylin, especially about performances and how to manage=
 them. I would like to understand precisely how it works to see if I can us=
e it in my business context.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d">I come =
from the relational database world, so as far as I understand on OLAP, the =
searches are performed on the values of primary keys in dimensions. These s=
ubsets are then joined to get the corresponding
 lines on the facts table. As the dimensions tables are much smaller than t=
he facts table, the queries run faster<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><b><span lang=3D"=
EN-US" style=3D"color:#1f497d"><span>1.<span style=3D"font:7.0pt &quot;Time=
s New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span></span></b><u></u><b><span lang=3D"EN-US" style=3D"color:#1f4=
97d">Questions on performances<u></u><u></u></span></b></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">th=
e raw data are stored in Hive, and the models and structures (cubes) are st=
ored in HBase; I presume that the whole .json files are stored, is it right=
 ?
<u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">Wh=
ere are the cube results stores (I mean after a build, a refresh or an appe=
nd action). Is it also in HBase ? I can see in HBase tables like &quot;KYLI=
N_FF46WDAAGH&quot;. Do these kinds of tables
 contain the cube data ? <u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">I =
noticed that when I build the =E2=80=98sample_cube=E2=80=99, the volume of =
data was very important compared to the size of the original files. Is ther=
e a way to reduce it (I saw a attribute in the $KYLIN_HOME/tomcat/conf/<wbr=
>server.xml
 file, called =E2=80=98compression=E2=80=99 for the connector on port 7070,=
 but I do not know if it is related to the cube size). I tried to change th=
is parameter to =E2=80=98yes=E2=80=99, but I noticed a huge increase of the=
 duration of cube generation. So I am wondering if it is the good
 method.<u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">Ho=
w is it possible to optimize cube size to keep good performance ? =C2=A0<u>=
</u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">In=
 Hive, putting indexes is not recommended. So how Kylin is ensuring good pe=
rformance when querying high volumes of data=C2=A0 ? Is it through the =E2=
=80=98rowkeys=E2=80=99 in the advanced settings when you
 build the cube ? <u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d">Or is t=
he answer elsewhere ?
<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><b><span lang=3D"=
EN-US" style=3D"color:#1f497d"><span>2.<span style=3D"font:7.0pt &quot;Time=
s New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span></span></b><u></u><b><span lang=3D"EN-US" style=3D"color:#1f4=
97d">Questions on cube building<u></u><u></u></span></b></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">By=
 the way, the =E2=80=98Advanced settings=E2=80=99 step is still unclear for=
 me. I tried to build a cube from scratch using the tables provided in the =
sample project. But I do not know very much what
 to put in this section. <u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">My=
 goal is to define groups of data on YEAR_BEG_DT, QTR_BEG_DT,MONTH_BEG_DT.
<u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">I =
do not understand very well why the aggregation group contains so many colu=
mns. I tried to remove as many as possible, but when I tried to set up the =
joins, but some fields were missing
 so the saving of the cube failed. <u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">Wh=
at shall we put exactly in the =E2=80=98Rowkeys=E2=80=99 section ? I unders=
tand that this is used to define data encoding (for speed access ? ).Am I r=
ight ?<u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><u></u><span lang=3D"EN-=
US" style=3D"font-family:Symbol;color:#1f497d"><span>=C2=B7<span style=3D"f=
ont:7.0pt &quot;Times New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0=C2=A0
</span></span></span><u></u><span lang=3D"EN-US" style=3D"color:#1f497d">Ar=
e the aggregation groups used for speed of the queries. I assume it is the =
case, because it represents the most commonly used associations of columns =
for the cube.
<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d">Thank y=
ou in advance for your help.
<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d">Best re=
gards,<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d">Jean-Lu=
c.<u></u><u></u></span></p>
<p class=3D"m_3029918813905275688MsoListParagraph"><span lang=3D"EN-US" sty=
le=3D"color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span lang=3D"EN-US" style=3D"color:#1f497d"><u></u>=
=C2=A0<u></u></span></p>
</div>
<br>
<br>
<font size=3D"1" face=3D"Arial" color=3D"#4F81BD">&quot;Ce message est dest=
in=C3=A9 exclusivement aux personnes ou entit=C3=A9s auxquelles il est adre=
ss=C3=A9 et peut contenir des informations privil=C3=A9gi=C3=A9es ou confid=
entielles. Si vous avez re=C3=A7u ce document par erreur, merci de nous l&#=
39;indiquer
 par retour, de ne pas le transmettre et de proc=C3=A9der =C3=A0 sa destruc=
tion. <br>
<br>
This message is solely intended for the use of the individual or entity to =
which it is addressed and may contain information that is privileged or con=
fidential. If you have received this communication by error, please notify =
us immediately by electronic mail,
 do not disclose it and delete the original message.&quot;</font>
</div>

</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br><div class=
=3D"gmail_signature" data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><d=
iv><div dir=3D"ltr">Best regards,<div><br></div><div>Shaofeng Shi =E5=8F=B2=
=E5=B0=91=E9=94=8B</div><div><br></div></div></div></div></div>
</div>

--94eb2c0bef54d383cf0565734239--