Return-Path: X-Original-To: apmail-asterixdb-notifications-archive@minotaur.apache.org Delivered-To: apmail-asterixdb-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A15521883A for ; Sun, 8 Nov 2015 21:57:13 +0000 (UTC) Received: (qmail 11526 invoked by uid 500); 8 Nov 2015 21:57:13 -0000 Delivered-To: apmail-asterixdb-notifications-archive@asterixdb.apache.org Received: (qmail 11499 invoked by uid 500); 8 Nov 2015 21:57:13 -0000 Mailing-List: contact notifications-help@asterixdb.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@asterixdb.incubator.apache.org Delivered-To: mailing list notifications@asterixdb.incubator.apache.org Received: (qmail 11490 invoked by uid 99); 8 Nov 2015 21:57:13 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 08 Nov 2015 21:57:13 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 23884C21B3 for ; Sun, 8 Nov 2015 21:57:13 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.97 X-Spam-Level: X-Spam-Status: No, score=0.97 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, T_RP_MATCHES_RCVD=-0.01] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id ODKBa5XVKb46 for ; Sun, 8 Nov 2015 21:57:12 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with SMTP id CEBA1429AA for ; Sun, 8 Nov 2015 21:57:11 +0000 (UTC) Received: (qmail 10821 invoked by uid 99); 8 Nov 2015 21:57:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 08 Nov 2015 21:57:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 0F16F2C1F60 for ; Sun, 8 Nov 2015 21:57:11 +0000 (UTC) Date: Sun, 8 Nov 2015 21:57:11 +0000 (UTC) From: "Cameron Samak (JIRA)" To: notifications@asterixdb.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (ASTERIXDB-1164) Indeterministic string codepoint function behavior MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ASTERIXDB-1164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cameron Samak updated ASTERIXDB-1164: ------------------------------------- Description: Repeatedly executing the same codepoint-to-string(string-to-codepoint($u.name)) query shown below results in completely different output. Two examples follow the offending query. I was unable to reproduce after a few tries with just codepoint-to-string. Query: use dataverse TinySocial; for $u in dataset FacebookUsers let $codepoint := string-to-codepoint($u.name) return {"string" : codepoint-to-string($codepoint), "codepoint": $codepoint} Sample Output: {code:JavaScript} [ { "string": "WilliiWynne\u0004", "codepoint": [ 87, 105, 108, 108, 105, 115, 87, 121, 110, 110, 101 ] } , { "string": "MargaritaStoddard", "codepoint": [ 77, 97, 114, 103, 97, 114, 105, 116, 97, 83, 116, 111, 100, 100, 97, 114, 100 ] } , { "string": "IsbelDull", "codepoint": [ 73, 115, 98, 101, 108, 68, 117, 108, 108 ] } , { "string": "NicholasStroh", "codepoint": [ 78, 105, 99, 104, 111, 108, 97, 115, 83, 116, 114, 111, 104 ] } , { "string": "WilaMilyiron", "codepoint": [ 78, 105, 108, 97, 77, 105, 108, 108, 105, 114, 111, 110 ] } , { "string": "WoodrowNehling", "codepoint": [ 87, 111, 111, 100, 114, 111, 119, 78, 101, 104, 108, 105, 110, 103 ] } , { "string": "BramHatch", "codepoint": [ 66, 114, 97, 109, 72, 97, 116, 99, 104 ] } , { "string": "EmoryUnk", "codepoint": [ 69, 109, 111, 114, 121, 85, 110, 107 ] } , { "string": "VonKemble", "codepoint": [ 86, 111, 110, 75, 101, 109, 98, 108, 101 ] } , { "string": "SuzannaTillson", "codepoint": [ 83, 117, 122, 97, 110, 110, 97, 84, 105, 108, 108, 115, 111, 110 ] } ] [ { "string": "WillisWynne", "codepoint": [ 87, 105, 108, 108, 105, 115, 87, 121, 110, 110, 101 ] } , { "string": "MargaritaStoddard", "codepoint": [ 77, 97, 114, 103, 97, 114, 105, 116, 97, 83, 116, 111, 100, 100, 97, 114, 100 ] } , { "string": "IsbelDull", "codepoint": [ 73, 115, 98, 101, 108, 68, 117, 108, 108 ] } , { "string": "NicholasStroh", "codepoint": [ 78, 105, 99, 104, 111, 108, 97, 115, 83, 116, 114, 111, 104 ] } , { "string": "NilaMilliron\u0004\u0000\u0000\u0000j", "codepoint": [ 78, 105, 108, 97, 77, 105, 108, 108, 105, 114, 111, 110 ] } , { "string": "WoodrowNehling", "codepoint": [ 87, 111, 111, 100, 114, 111, 119, 78, 101, 104, 108, 105, 110, 103 ] } , { "string": "BramHatch", "codepoint": [ 66, 114, 97, 109, 72, 97, 116, 99, 104 ] } , { "string": "EmoryUnk", "codepoint": [ 69, 109, 111, 114, 121, 85, 110, 107 ] } , { "string": "VonKemble", "codepoint": [ 86, 111, 110, 75, 101, 109, 98, 108, 101 ] } , { "string": "SuzannaTillson", "codepoint": [ 83, 117, 122, 97, 110, 110, 97, 84, 105, 108, 108, 115, 111, 110 ] } ] {code} was: Repeatedly executing the same codepoint-to-string(string-to-codepoint($u.name)) query shown below results in completely different output. Two examples follow the offending query. I was unable to reproduce after a few tries with just codepoint-to-string. Query: use dataverse TinySocial; for $u in dataset FacebookUsers let $codepoint := string-to-codepoint($u.name) return {"string" : codepoint-to-string($codepoint), "codepoint": $codepoint} Sample Output: [ { "string": "WilliiWynne\u0004", "codepoint": [ 87, 105, 108, 108, 105, 115, 87, 121, 110, 110, 101 ] } , { "string": "MargaritaStoddard", "codepoint": [ 77, 97, 114, 103, 97, 114, 105, 116, 97, 83, 116, 111, 100, 100, 97, 114, 100 ] } , { "string": "IsbelDull", "codepoint": [ 73, 115, 98, 101, 108, 68, 117, 108, 108 ] } , { "string": "NicholasStroh", "codepoint": [ 78, 105, 99, 104, 111, 108, 97, 115, 83, 116, 114, 111, 104 ] } , { "string": "WilaMilyiron", "codepoint": [ 78, 105, 108, 97, 77, 105, 108, 108, 105, 114, 111, 110 ] } , { "string": "WoodrowNehling", "codepoint": [ 87, 111, 111, 100, 114, 111, 119, 78, 101, 104, 108, 105, 110, 103 ] } , { "string": "BramHatch", "codepoint": [ 66, 114, 97, 109, 72, 97, 116, 99, 104 ] } , { "string": "EmoryUnk", "codepoint": [ 69, 109, 111, 114, 121, 85, 110, 107 ] } , { "string": "VonKemble", "codepoint": [ 86, 111, 110, 75, 101, 109, 98, 108, 101 ] } , { "string": "SuzannaTillson", "codepoint": [ 83, 117, 122, 97, 110, 110, 97, 84, 105, 108, 108, 115, 111, 110 ] } ] [ { "string": "WillisWynne", "codepoint": [ 87, 105, 108, 108, 105, 115, 87, 121, 110, 110, 101 ] } , { "string": "MargaritaStoddard", "codepoint": [ 77, 97, 114, 103, 97, 114, 105, 116, 97, 83, 116, 111, 100, 100, 97, 114, 100 ] } , { "string": "IsbelDull", "codepoint": [ 73, 115, 98, 101, 108, 68, 117, 108, 108 ] } , { "string": "NicholasStroh", "codepoint": [ 78, 105, 99, 104, 111, 108, 97, 115, 83, 116, 114, 111, 104 ] } , { "string": "NilaMilliron\u0004\u0000\u0000\u0000j", "codepoint": [ 78, 105, 108, 97, 77, 105, 108, 108, 105, 114, 111, 110 ] } , { "string": "WoodrowNehling", "codepoint": [ 87, 111, 111, 100, 114, 111, 119, 78, 101, 104, 108, 105, 110, 103 ] } , { "string": "BramHatch", "codepoint": [ 66, 114, 97, 109, 72, 97, 116, 99, 104 ] } , { "string": "EmoryUnk", "codepoint": [ 69, 109, 111, 114, 121, 85, 110, 107 ] } , { "string": "VonKemble", "codepoint": [ 86, 111, 110, 75, 101, 109, 98, 108, 101 ] } , { "string": "SuzannaTillson", "codepoint": [ 83, 117, 122, 97, 110, 110, 97, 84, 105, 108, 108, 115, 111, 110 ] } ] > Indeterministic string codepoint function behavior > -------------------------------------------------- > > Key: ASTERIXDB-1164 > URL: https://issues.apache.org/jira/browse/ASTERIXDB-1164 > Project: Apache AsterixDB > Issue Type: Bug > Components: Functions - AQL > Environment: master (97dd45d2d4426bcb1a3971f021839dbfd7993fc2) on Windows > Reporter: Cameron Samak > > Repeatedly executing the same codepoint-to-string(string-to-codepoint($u.name)) query shown below results in completely different output. Two examples follow the offending query. > I was unable to reproduce after a few tries with just codepoint-to-string. > Query: > use dataverse TinySocial; > for $u in dataset FacebookUsers > let $codepoint := string-to-codepoint($u.name) > return {"string" : codepoint-to-string($codepoint), "codepoint": $codepoint} > Sample Output: > {code:JavaScript} > [ { "string": "WilliiWynne\u0004", "codepoint": [ 87, 105, 108, 108, 105, 115, 87, 121, 110, 110, 101 ] } > , { "string": "MargaritaStoddard", "codepoint": [ 77, 97, 114, 103, 97, 114, 105, 116, 97, 83, 116, 111, 100, 100, 97, 114, 100 ] } > , { "string": "IsbelDull", "codepoint": [ 73, 115, 98, 101, 108, 68, 117, 108, 108 ] } > , { "string": "NicholasStroh", "codepoint": [ 78, 105, 99, 104, 111, 108, 97, 115, 83, 116, 114, 111, 104 ] } > , { "string": "WilaMilyiron", "codepoint": [ 78, 105, 108, 97, 77, 105, 108, 108, 105, 114, 111, 110 ] } > , { "string": "WoodrowNehling", "codepoint": [ 87, 111, 111, 100, 114, 111, 119, 78, 101, 104, 108, 105, 110, 103 ] } > , { "string": "BramHatch", "codepoint": [ 66, 114, 97, 109, 72, 97, 116, 99, 104 ] } > , { "string": "EmoryUnk", "codepoint": [ 69, 109, 111, 114, 121, 85, 110, 107 ] } > , { "string": "VonKemble", "codepoint": [ 86, 111, 110, 75, 101, 109, 98, 108, 101 ] } > , { "string": "SuzannaTillson", "codepoint": [ 83, 117, 122, 97, 110, 110, 97, 84, 105, 108, 108, 115, 111, 110 ] } > ] > [ { "string": "WillisWynne", "codepoint": [ 87, 105, 108, 108, 105, 115, 87, 121, 110, 110, 101 ] } > , { "string": "MargaritaStoddard", "codepoint": [ 77, 97, 114, 103, 97, 114, 105, 116, 97, 83, 116, 111, 100, 100, 97, 114, 100 ] } > , { "string": "IsbelDull", "codepoint": [ 73, 115, 98, 101, 108, 68, 117, 108, 108 ] } > , { "string": "NicholasStroh", "codepoint": [ 78, 105, 99, 104, 111, 108, 97, 115, 83, 116, 114, 111, 104 ] } > , { "string": "NilaMilliron\u0004\u0000\u0000\u0000j", "codepoint": [ 78, 105, 108, 97, 77, 105, 108, 108, 105, 114, 111, 110 ] } > , { "string": "WoodrowNehling", "codepoint": [ 87, 111, 111, 100, 114, 111, 119, 78, 101, 104, 108, 105, 110, 103 ] } > , { "string": "BramHatch", "codepoint": [ 66, 114, 97, 109, 72, 97, 116, 99, 104 ] } > , { "string": "EmoryUnk", "codepoint": [ 69, 109, 111, 114, 121, 85, 110, 107 ] } > , { "string": "VonKemble", "codepoint": [ 86, 111, 110, 75, 101, 109, 98, 108, 101 ] } > , { "string": "SuzannaTillson", "codepoint": [ 83, 117, 122, 97, 110, 110, 97, 84, 105, 108, 108, 115, 111, 110 ] } > ] > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)