Mailing-List: contact issues-help@drill.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@drill.apache.org
Date: Sat, 28 Feb 2015 02:59:04 +0000 (UTC)
From: "Aman Sinha (JIRA)" <jira@apache.org>
To: issues@drill.apache.org
Message-ID: <JIRA.12778426.1425089547000.40446.1425092344428@Atlassian.JIRA>
In-Reply-To: <JIRA.12778426.1425089547000@Atlassian.JIRA>
References: <JIRA.12778426.1425089547000@Atlassian.JIRA>
 <JIRA.12778426.1425089547102@arcas>
Subject: [jira] [Commented] (DRILL-2347) Parallelize metadata reads for
 Parquet metadata
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit


    [ https://issues.apache.org/jira/browse/DRILL-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14341266#comment-14341266 ] 

Aman Sinha commented on DRILL-2347:
-----------------------------------

I have  not run all regression tests yet; only unit tests.  Submitting for regression tests. 

> Parallelize metadata reads for Parquet metadata
> -----------------------------------------------
>
>                 Key: DRILL-2347
>                 URL: https://issues.apache.org/jira/browse/DRILL-2347
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Metadata
>    Affects Versions: 0.7.0
>            Reporter: Aman Sinha
>            Assignee: Jacques Nadeau
>             Fix For: 0.8.0
>
>         Attachments: 0001-DRILL-2347-Support-parquet-metadata-read-paralleliza.patch
>
>
> During the planning process, reading the metadata from Parquet files is sequential, making the planning slow for large number of files.  This should be parallelized. 


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)