Return-Path: X-Original-To: apmail-drill-dev-archive@www.apache.org Delivered-To: apmail-drill-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0240518CB1 for ; Mon, 22 Jun 2015 17:33:01 +0000 (UTC) Received: (qmail 75744 invoked by uid 500); 22 Jun 2015 17:33:00 -0000 Delivered-To: apmail-drill-dev-archive@drill.apache.org Received: (qmail 75680 invoked by uid 500); 22 Jun 2015 17:33:00 -0000 Mailing-List: contact dev-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list dev@drill.apache.org Received: (qmail 75654 invoked by uid 99); 22 Jun 2015 17:33:00 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Jun 2015 17:33:00 +0000 Date: Mon, 22 Jun 2015 17:33:00 +0000 (UTC) From: "Steven Phillips (JIRA)" To: dev@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (DRILL-3333) Add support for auto-partitioning in parquet writer MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Steven Phillips created DRILL-3333: -------------------------------------- Summary: Add support for auto-partitioning in parquet writer Key: DRILL-3333 URL: https://issues.apache.org/jira/browse/DRILL-3333 Project: Apache Drill Issue Type: Bug Reporter: Steven Phillips When a table is created with a partition by clause, the parquet writer will create separate files for the different partition values. The data will first be sorted by the partition keys, and the parquet writer will create new file when it encounters a new value for the partition columns. When data is queried against the data that was created this way, partition pruning will work if the filter contains a partition column. And unlike directory based partitioning, no view is required, nor is it necessary to reference the dir* column names. -- This message was sent by Atlassian JIRA (v6.3.4#6332)