Mailing-List: contact issues-help@drill.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@drill.apache.org
Date: Mon, 22 Jun 2015 19:07:00 +0000 (UTC)
From: "Steven Phillips (JIRA)" <jira@apache.org>
To: issues@drill.apache.org
Message-ID: <JIRA.12839613.1434994326000.139179.1435000020356@Atlassian.JIRA>
In-Reply-To: <JIRA.12839613.1434994326000@Atlassian.JIRA>
References: <JIRA.12839613.1434994326000@Atlassian.JIRA>
 <JIRA.12839613.1434994326721@arcas>
Subject: [jira] [Updated] (DRILL-3333) Add support for auto-partitioning in
 parquet writer
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit


     [ https://issues.apache.org/jira/browse/DRILL-3333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Steven Phillips updated DRILL-3333:
-----------------------------------
    Attachment: DRILL-3333.patch

> Add support for auto-partitioning in parquet writer
> ---------------------------------------------------
>
>                 Key: DRILL-3333
>                 URL: https://issues.apache.org/jira/browse/DRILL-3333
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Steven Phillips
>            Assignee: Aman Sinha
>         Attachments: DRILL-3333.patch, DRILL-3333.patch
>
>
> When a table is created with a partition by clause, the parquet writer will create separate files for the different partition values. The data will first be sorted by the partition keys, and the parquet writer will create new file when it encounters a new value for the partition columns.
> When data is queried against the data that was created this way, partition pruning will work if the filter contains a partition column. And unlike directory based partitioning, no view is required, nor is it necessary to reference the dir* column names.


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)