Return-Path: X-Original-To: apmail-kylin-dev-archive@minotaur.apache.org Delivered-To: apmail-kylin-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6D31D182F6 for ; Tue, 1 Dec 2015 08:13:20 +0000 (UTC) Received: (qmail 66663 invoked by uid 500); 1 Dec 2015 08:13:20 -0000 Delivered-To: apmail-kylin-dev-archive@kylin.apache.org Received: (qmail 66612 invoked by uid 500); 1 Dec 2015 08:13:20 -0000 Mailing-List: contact dev-help@kylin.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@kylin.incubator.apache.org Delivered-To: mailing list dev@kylin.incubator.apache.org Received: (qmail 66601 invoked by uid 99); 1 Dec 2015 08:13:20 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Dec 2015 08:13:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id C615FC6B3A for ; Tue, 1 Dec 2015 08:13:19 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.991 X-Spam-Level: *** X-Spam-Status: No, score=3.991 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, KAM_LAZY_DOMAIN_SECURITY=1, T_RP_MATCHES_RCVD=-0.01, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id Jlc2jWAG9M7y for ; Tue, 1 Dec 2015 08:13:07 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with SMTP id 468732122C for ; Tue, 1 Dec 2015 08:13:05 +0000 (UTC) Received: (qmail 65012 invoked by uid 99); 1 Dec 2015 08:13:04 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Dec 2015 08:13:04 +0000 Received: from mail-io0-f181.google.com (mail-io0-f181.google.com [209.85.223.181]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 18E231A00C5 for ; Tue, 1 Dec 2015 08:13:04 +0000 (UTC) Received: by ioir85 with SMTP id r85so1331562ioi.1 for ; Tue, 01 Dec 2015 00:13:03 -0800 (PST) X-Received: by 10.107.12.234 with SMTP id 103mr69790375iom.124.1448957583444; Tue, 01 Dec 2015 00:13:03 -0800 (PST) MIME-Version: 1.0 Received: by 10.107.164.23 with HTTP; Tue, 1 Dec 2015 00:12:43 -0800 (PST) In-Reply-To: References: <565D4A46.3060600@jd.com> <565D4D05.6060607@jd.com> From: hongbin ma Date: Tue, 1 Dec 2015 16:12:43 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: How to support Avro Complex Type on Kylin To: dev Content-Type: multipart/alternative; boundary=001a113f8fca0e699f0525d1bafa --001a113f8fca0e699f0525d1bafa Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable =E2=80=8B@shaofeng, it looks like a nice feature.=E2=80=8B On Tue, Dec 1, 2015 at 4:07 PM, Shi, Shaofeng wrote: > Kylin should automatically skip these complex columns, instead of blockin= g > user from import the table, what do you think? > > On 12/1/15, 3:32 PM, "Xiaoyu Wang" wrote: > > >Yes You can create a hive view to remove the datatype array,map column. > > > >=E5=9C=A8 2015=E5=B9=B412=E6=9C=8801=E6=97=A5 15:26, Yiming Liu =E5=86= =99=E9=81=93: > >> Thanks Xiaoyu, for the quick response. > >> > >> > >> Currently, there is no way to remove those fields. The error happens o= n > >>the first step "Sync Hive tables" when designing cube. > >> > >> > >> I will redesign my original tables to fit the datatype requirement. > >> > >> > >> ------------------ Original ------------------ > >> From: "Xiaoyu Wang";; > >> Date: Tue, Dec 1, 2015 03:20 PM > >> To: "dev"; > >> > >> Subject: Re: How to support Avro Complex Type on Kylin > >> > >> > >> > >> Kylin does not support datatype like "array" "map". > >> Can't set the array,map datatype column as dimension. > >> You can remove the array,map column from cube design, and retry . > >> > >> =E5=9C=A8 2015=E5=B9=B412=E6=9C=8801=E6=97=A5 15:05, Yiming Liu =E5=86= =99=E9=81=93: > >>> Hi Kylin expert, > >>> > >>> I have a table with avro encoding. It has map, array field type. I > >>>could query the table on Hive. > >>> > >>> When I sync the table into Kylin, the Kylin says: > >>> "bad data type -- array<string>, does not match > >>>(any|char|varchar|boolean|binary|integer|tinyint|smallint|bigint|decim= al > >>>|numeric|float|real|double|date|time|datetime|timestamp|byte|int|short= |l > >>>ong|string|hllc|_literal_type|_derived_type)\s*(?:[(]([\d\s,]+)[)])?" > >>> > >>> So it seems Kylin does not support the avro complex type, is it right= ? > >>>Do you have any suggestion how to process the complex data type. > >>> > >>> SerDe Library: org.apache.hadoop.hive.serde2.avro.AvroSerDe > >>> > >>>InputFormat: org.apache.hadoop.hive.ql.io.avro.AvroContainerInputForma= t > >>> > >>>OutputFormat: > org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputForma > >>>t > >>> > >>> Following is my table schema: > >>> 0 sessionid string > >>> 1 userid string > >>> 2 hosts array > >>> 3 domain string > >>> 4 visittimes int > >>> 5 firsttimestamp bigint > >>> 6 lasttimestamp bigint > >>> 7 sessiontimestamp bigint > >>> 8 useragent map > >>> 9 srcaddrunsignedint bigint > >>> 10 srcaddrstr string > >>> 11 srcaddrcity map > >>> 12 srcaddrlocation map > >>> 13 destaddrunsignedint bigint > >>> 14 destaddrstr string > >>> 15 destaddrcity map > >>> 16 destaddrlocation map > >>> 17 keywords map> > >>> 18 topics map > >>> 19 cookies map > >>> 20 urls array > >>> 21 year int > >>> 22 month int > >>> 23 day int > >>> 24 hour int > > --=20 Regards, *Bin Mahone | =E9=A9=AC=E6=B4=AA=E5=AE=BE* Apache Kylin: http://kylin.io Github: https://github.com/binmahone --001a113f8fca0e699f0525d1bafa--