kylin-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From " Kaige Liu (JIRA)" <>
Subject [jira] [Updated] (KYLIN-3044) Support SQL Server as data source
Date Wed, 22 Nov 2017 13:32:00 GMT


Kaige Liu updated KYLIN-3044:
    Attachment: KYLIN-3044-sqlserver-as-datasource.patch

Sqoop splits data to a couple of parts and import them parallel. I add a property kylin.source.jdbc.sqoop-mapper-num
to specify how many splits should be divided. Sqoop would run a mapper for each split.
To make each mapper gets even input, split column is chosen following some rules:
1. Prefer ClusteredBy column
2. Prefer DistributedBy column
3. Prefer Partition date column
4. Prefer Higher cardinality column
5. Prefer numeric column
6. Pick a column at first glance

Patch updated.

> Support SQL Server as data source
> ---------------------------------
>                 Key: KYLIN-3044
>                 URL:
>             Project: Kylin
>          Issue Type: Task
>            Reporter:  Kaige Liu
>            Assignee:  Kaige Liu
>         Attachments: KYLIN-3044-sqlserver-as-datasource.patch, KYLIN-3044-sqlserver-as-datasource.patch
> [KYLIN-1351|] has added Vertica as data
source. Base on the work of KYLIN-1351, I'd like to enable SQL Server as data source of kylin.

This message was sent by Atlassian JIRA

View raw message