Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id AD803200CD7 for ; Tue, 1 Aug 2017 09:15:02 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id ABA6D16696A; Tue, 1 Aug 2017 07:15:02 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id F309C16696C for ; Tue, 1 Aug 2017 09:15:01 +0200 (CEST) Received: (qmail 58428 invoked by uid 500); 1 Aug 2017 07:15:01 -0000 Mailing-List: contact issues-help@carbondata.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@carbondata.apache.org Delivered-To: mailing list issues@carbondata.apache.org Received: (qmail 58419 invoked by uid 99); 1 Aug 2017 07:15:01 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Aug 2017 07:15:01 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id E11D6E112D; Tue, 1 Aug 2017 07:15:00 +0000 (UTC) From: zzcclp To: issues@carbondata.apache.org Reply-To: issues@carbondata.apache.org References: In-Reply-To: Subject: [GitHub] carbondata pull request #1205: [CARBONDATA-1086] updated configuration-param... Content-Type: text/plain Message-Id: <20170801071500.E11D6E112D@git1-us-west.apache.org> Date: Tue, 1 Aug 2017 07:15:00 +0000 (UTC) archived-at: Tue, 01 Aug 2017 07:15:02 -0000 Github user zzcclp commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1205#discussion_r130534145 --- Diff: docs/dml-operation-on-carbondata.md --- @@ -149,6 +149,50 @@ You can use the following options to load data: * If this option is set to TRUE, then high.cardinality.identify.enable property will be disabled during data load. +- **SORT_SCOPE:** This property can have four possible values : + + * BATCH_SORT : The sorting scope is smaller and more index tree will be created,thus loading is faster but query maybe slower. + + * LOCAL_SORT : The sorting scope is bigger and one index tree per data node will be created, thus loading is slower but query is faster. + + * GLOBAL_SORT : The sorting scope is bigger and one index tree per task will be created, thus loading is slower but query is faster. + + * NO_SORT : Feasible if we want to load our data in unsorted manner. + + For BATCH_SORT: + + ``` + OPTIONS ('SORT_SCOPE'='BATCH_SORT') + ``` + + You can also specify the sort size option for sort scope. + + ``` + OPTIONS('SORT_SCOPE'='BATCH_SORT', 'batch_sort_size_inmb'='7') + ``` + + Note : + + * batch_sort_size_inmb : Size of data in MB to be processed in batch. By default it is the 45 percent size of sort.inmemory.size.inmb(Memory size in MB available for in-memory sort). + + For GLOBAL_SORT : --- End diff -- Suggestion: add below note: `'SINGLE_PASS' must be false.` --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---