Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 33884200B72 for ; Fri, 26 Aug 2016 08:29:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 320FD160AC3; Fri, 26 Aug 2016 06:29:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 81E1A160AB0 for ; Fri, 26 Aug 2016 08:29:21 +0200 (CEST) Received: (qmail 91541 invoked by uid 500); 26 Aug 2016 06:29:20 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 91528 invoked by uid 99); 26 Aug 2016 06:29:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Aug 2016 06:29:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 8030F2C0032 for ; Fri, 26 Aug 2016 06:29:20 +0000 (UTC) Date: Fri, 26 Aug 2016 06:29:20 +0000 (UTC) From: "Khurram Faraaz (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (DRILL-4862) wrong results - use of convert_from(binary_string(key),'UTF8') in filter results in wrong results MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 26 Aug 2016 06:29:22 -0000 [ https://issues.apache.org/jira/browse/DRILL-4862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15438542#comment-15438542 ] Khurram Faraaz commented on DRILL-4862: --------------------------------------- This test also returns wrong results, adding it here now, so that test gets added once the issue is fixed. {noformat} 0: jdbc:drill:schema=dfs.tmp> select key,convert_from(binary_string(key),'UTF8') from `f1.json`; +-------------------+---------+ | key | EXPR$1 | +-------------------+---------+ | 0123\x31\x32\x33 | 0123 | | 4567\x35\x36\x37 | 4567 | | 8901\x39\x30\x31 | 8901 | | 0000\x30\x30\x30 | 0000 | | 1111\x31\x31\x31 | 1111 | | 5555\x35\x35\x35 | 5555 | | 8888\x38\x38\x38 | 8888 | | 9999\x39\x39\x39 | 9999 | | ABCD\x42\x43\x44 | ABCD | | EFGH\x46\x47\x48 | EFGH | | IADF\x41\x44\x46 | IADF | | JKLM\x4b\x4c\x4d | JKLM | | WXYZ\x58\x59\x5a | WXYZ | | NOPQ\x4f\x50\x51 | NOPQ | | FFFF | FFFF | +-------------------+---------+ 15 rows selected (2.674 seconds) {noformat} > wrong results - use of convert_from(binary_string(key),'UTF8') in filter results in wrong results > ------------------------------------------------------------------------------------------------- > > Key: DRILL-4862 > URL: https://issues.apache.org/jira/browse/DRILL-4862 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Flow > Affects Versions: 1.8.0 > Environment: 4 node cluster CentOS > Reporter: Khurram Faraaz > > These results do not look right, i.e when the predicate has convert_from(binary_string(key),'UTF8') > Apache drill 1.8.0-SNAPSHOT git commit ID: 57dc9f43 > {noformat} > [root@centos-0x drill4478]# cat f1.json > {"key":"\\x30\\x31\\x32\\x33"} > {"key":"\\x34\\x35\\x36\\x37"} > {"key":"\\x38\\x39\\x30\\x31"} > {"key":"\\x30\\x30\\x30\\x30"} > {"key":"\\x31\\x31\\x31\\x31"} > {"key":"\\x35\\x35\\x35\\x35"} > {"key":"\\x38\\x38\\x38\\x38"} > {"key":"\\x39\\x39\\x39\\x39"} > {"key":"\\x41\\x42\\x43\\x44"} > {"key":"\\x45\\x46\\x47\\x48"} > {"key":"\\x49\\x41\\x44\\x46"} > {"key":"\\x4a\\x4b\\x4c\\x4d"} > {"key":"\\x57\\x58\\x59\\x5a"} > {"key":"\\x4e\\x4f\\x50\\x51"} > {"key":"\\x46\\x46\\x46\\x46"} > {noformat} > results without the predicate - these are correct results > {noformat} > 0: jdbc:drill:schema=dfs.tmp> select convert_from(binary_string(key),'UTF8') from `f1.json`; > +---------+ > | EXPR$0 | > +---------+ > | 0123 | > | 4567 | > | 8901 | > | 0000 | > | 1111 | > | 5555 | > | 8888 | > | 9999 | > | ABCD | > | EFGH | > | IADF | > | JKLM | > | WXYZ | > | NOPQ | > | FFFF | > +---------+ > 15 rows selected (0.256 seconds) > {noformat} > results with a predicate - these results don't look correct > {noformat} > 0: jdbc:drill:schema=dfs.tmp> select convert_from(binary_string(key),'UTF8') from `f1.json` where convert_from(binary_string(key),'UTF8') is not null; > +----------+ > | EXPR$0 | > +----------+ > | 0123123 | > | 4567567 | > | 8901901 | > | 0000000 | > | 1111111 | > | 5555555 | > | 8888888 | > | 9999999 | > | ABCDBCD | > | EFGHFGH | > | IADFADF | > | JKLMKLM | > | WXYZXYZ | > | NOPQOPQ | > | FFFFFFF | > +----------+ > 15 rows selected (0.279 seconds) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)