drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-3635) IllegalArgumentException - not a Parquet file (too small)
Date Mon, 17 Aug 2015 16:11:46 GMT

    [ https://issues.apache.org/jira/browse/DRILL-3635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14699755#comment-14699755

ASF GitHub Bot commented on DRILL-3635:

GitHub user adeneche opened a pull request:


    DRILL-3635: IllegalArgumentException - not a Parquet file (too small)

    make sure we close the writer before we return OK_NEW_SCHEMA. Because WriterRecordBatch
only returns one single batch and because on the client side DrillResultSetImpl.execute()
will wait for the first batch to come before returning, this will ensure that execute() won't
return until all files have been flushed to disk

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/adeneche/incubator-drill DRILL-3635

Alternatively you can review and apply these changes as the patch at:


To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #117
commit 198e9868880303efe470da8c175b3b696b75b76a
Author: adeneche <adeneche@gmail.com>
Date:   2015-08-13T20:04:59Z

    DRILL-3635: IllegalArgumentException - not a Parquet file (too small)


> IllegalArgumentException - not a Parquet file (too small)
> ---------------------------------------------------------
>                 Key: DRILL-3635
>                 URL: https://issues.apache.org/jira/browse/DRILL-3635
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Flow, Storage - Parquet
>    Affects Versions: 1.1.0
>         Environment: Test framework
>            Reporter: Chris Westin
>            Assignee: Deneche A. Hakim
>             Fix For: 1.2.0
> The (MapR internal) regression suite is sporadically seeing this error:
> /root/private-sql-hadoop-test/framework/resources/Precommit/Functional/ctas_flatten/100000rows/filter4.q
> Query: 
> select * from dfs.ctas_flatten.`filter4_100000rows_ctas`
> Failed with exception
> java.sql.SQLException: SYSTEM ERROR: IllegalArgumentException: maprfs:///drill/testdata/ctas_flatten/filter4_100000rows_ctas/0_0_0.parquet
is not a Parquet file (too small)
> [Error Id: 9749d6a7-685d-4663-9b27-1a456a5dec40 on drillats3.qa.lab:31010]
> 	at org.apache.drill.jdbc.impl.DrillCursor.nextRowInternally(DrillCursor.java:244)
> 	at org.apache.drill.jdbc.impl.DrillCursor.loadInitialSchema(DrillCursor.java:287)
> 	at org.apache.drill.jdbc.impl.DrillResultSetImpl.execute(DrillResultSetImpl.java:1362)
> 	at org.apache.drill.jdbc.impl.DrillResultSetImpl.execute(DrillResultSetImpl.java:72)
> 	at net.hydromatic.avatica.AvaticaConnection.executeQueryInternal(AvaticaConnection.java:404)
> 	at net.hydromatic.avatica.AvaticaStatement.executeQueryInternal(AvaticaStatement.java:351)
> 	at net.hydromatic.avatica.AvaticaStatement.executeQuery(AvaticaStatement.java:78)
> 	at org.apache.drill.jdbc.impl.DrillStatementImpl.executeQuery(DrillStatementImpl.java:96)
> 	at org.apache.drill.test.framework.DrillTestJdbc.executeQuery(DrillTestJdbc.java:144)
> 	at org.apache.drill.test.framework.DrillTestJdbc.run(DrillTestJdbc.java:83)
> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> 	at java.lang.Thread.run(Thread.java:744)
> Caused by: org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR: IllegalArgumentException:
maprfs:///drill/testdata/ctas_flatten/filter4_100000rows_ctas/0_0_0.parquet is not a Parquet
file (too small)
> It doesn't happen every time, but based on looking at log files, it seems to happen more
than half the time.

This message was sent by Atlassian JIRA

View raw message