drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul Challapalli (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-1115) parquet writer never finishes when the source contains huge number of small files
Date Tue, 08 Jul 2014 00:25:33 GMT
Rahul Challapalli created DRILL-1115:
----------------------------------------

             Summary: parquet writer never finishes when the source contains huge number of
small files
                 Key: DRILL-1115
                 URL: https://issues.apache.org/jira/browse/DRILL-1115
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Writer
            Reporter: Rahul Challapalli


git.commit.id.abbrev=790a2ad
Build # 26246

When we try to use 'create table as.....' and the source folder contains around 5000 text
files, drill never completes. I left the query to tun overnight, but it still didn't complete.
However I see nonstop activity in the log files which suggests drill is actually doing something.

Cluster Size : 2
Each file contains only a single number.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message