Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A638F200D1E for ; Wed, 4 Oct 2017 07:00:10 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A4FD3160BD5; Wed, 4 Oct 2017 05:00:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E9D431609DE for ; Wed, 4 Oct 2017 07:00:09 +0200 (CEST) Received: (qmail 33164 invoked by uid 500); 4 Oct 2017 05:00:09 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 33155 invoked by uid 99); 4 Oct 2017 05:00:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Oct 2017 05:00:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id E89DD1A1411 for ; Wed, 4 Oct 2017 05:00:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.502 X-Spam-Level: X-Spam-Status: No, score=-99.502 tagged_above=-999 required=6.31 tests=[KAM_NUMSUBJECT=0.5, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id iToxK0gO1A3R for ; Wed, 4 Oct 2017 05:00:05 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 023995F29A for ; Wed, 4 Oct 2017 05:00:05 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8B692E0D22 for ; Wed, 4 Oct 2017 05:00:03 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id B26CE242DD for ; Wed, 4 Oct 2017 05:00:01 +0000 (UTC) Date: Wed, 4 Oct 2017 05:00:01 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (DRILL-5830) Resolve regressions to MapR DB from DRILL-5546 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 04 Oct 2017 05:00:10 -0000 [ https://issues.apache.org/jira/browse/DRILL-5830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16190817#comment-16190817 ] ASF GitHub Bot commented on DRILL-5830: --------------------------------------- Github user paul-rogers commented on the issue: https://github.com/apache/drill/pull/968 Thanks for the explanation! Bugs in the code are always a concern, but I personally prefer to fix them rather than write more code that tries to work around potential, unknown bugs in other code. (Working around known bugs is, of course, part of the game.) Sounds like we'll need to test the MapR-DB case for the conditions you identified; that will tell is if the HBase solution (which seems to work now) can also be applied to MapR DB. > Resolve regressions to MapR DB from DRILL-5546 > ---------------------------------------------- > > Key: DRILL-5830 > URL: https://issues.apache.org/jira/browse/DRILL-5830 > Project: Apache Drill > Issue Type: Bug > Affects Versions: 1.12.0 > Reporter: Paul Rogers > Assignee: Paul Rogers > Fix For: 1.12.0 > > > DRILL-5546 added a number of fixes for empty batches. One part of the fix was for HBase. Key changes: > * Add code to expand wildcards in the planner. (i.e. SELECT *) > * Remove support for wildcards in the HBase record reader. > As noted in DRILL-5775, this change had the effect of breaking support for MapR-DB binary (which is API compatible with HBase.) DRILL-5775 does this by expanding wildcards in the planner for MapR DB as was done for HBase in DRILL-5546. > Unfortunately, this change introduced other regressions into the code as described by DRILL-5706. > Investigation of those issues revealed that we should back out the original DRILL-5546 changes and go down a different route. > As it turns out, HBase already had a project push-down rule that expanded wildcards. However, that rule didn't work correctly some of the time. DRILL-5546 fixed that bug, ensuring that wildcards are expanded (at least in the cases tested for this ticket.) > The actual issue turned out to be a bug in the {{RecordBatchLoader}} class which did not consider map contents when detecting schema change. As a result, results like (row_key, cf\{}) were treated the same as (row_key, cf\{mycol}) and the actual data colums were discarded, but randomly depending on batch arrival order. -- This message was sent by Atlassian JIRA (v6.4.14#64029)