drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jacques Nadeau (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-2483) Configuration parameter to change default record batch size for scanners
Date Fri, 20 Mar 2015 20:55:38 GMT

     [ https://issues.apache.org/jira/browse/DRILL-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jacques Nadeau updated DRILL-2483:
----------------------------------
    Summary: Configuration parameter to change default record batch size for scanners  (was:
Make buffer that rows are read into during execution configurable for testing purposes)

> Configuration parameter to change default record batch size for scanners
> ------------------------------------------------------------------------
>
>                 Key: DRILL-2483
>                 URL: https://issues.apache.org/jira/browse/DRILL-2483
>             Project: Apache Drill
>          Issue Type: Wish
>          Components: Storage - Other
>            Reporter: Victoria Markman
>             Fix For: 0.9.0
>
>
> We've found a bug recently where if table had multiple duplicate rows and duplicate rows
span multiple buffers, merge join returned wrong result. Test case had a table with 10,000
rows.
> The same problem could be reproduced on a much smaller data set if buffer size was configurable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message