Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id CD6EA200C52 for ; Mon, 10 Apr 2017 21:53:45 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id CBEEC160B99; Mon, 10 Apr 2017 19:53:45 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 1AC78160B7F for ; Mon, 10 Apr 2017 21:53:44 +0200 (CEST) Received: (qmail 83007 invoked by uid 500); 10 Apr 2017 19:53:44 -0000 Mailing-List: contact notifications-help@asterixdb.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@asterixdb.apache.org Delivered-To: mailing list notifications@asterixdb.apache.org Received: (qmail 82998 invoked by uid 99); 10 Apr 2017 19:53:44 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Apr 2017 19:53:44 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id DAB2D1A01C2 for ; Mon, 10 Apr 2017 19:53:43 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id Ae1v4SFghlOf for ; Mon, 10 Apr 2017 19:53:42 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 681915FBFC for ; Mon, 10 Apr 2017 19:53:42 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id D7663E00C4 for ; Mon, 10 Apr 2017 19:53:41 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 9457520D6D for ; Mon, 10 Apr 2017 19:53:41 +0000 (UTC) Date: Mon, 10 Apr 2017 19:53:41 +0000 (UTC) From: "Taewoo Kim (JIRA)" To: notifications@asterixdb.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Closed] (ASTERIXDB-1877) Full-text search throw exception MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 10 Apr 2017 19:53:46 -0000 [ https://issues.apache.org/jira/browse/ASTERIXDB-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Taewoo Kim closed ASTERIXDB-1877. --------------------------------- Resolution: Fixed > Full-text search throw exception > -------------------------------- > > Key: ASTERIXDB-1877 > URL: https://issues.apache.org/jira/browse/ASTERIXDB-1877 > Project: Apache AsterixDB > Issue Type: Bug > Environment: master: {"git.tags":"","git.build.version":"0.9.1-SNAPSHOT","git.closest.tag.commit.count":"175","git.commit.id.abbrev":"39d09a0" ... } > Reporter: Jianfeng Jia > Assignee: Taewoo Kim > > Use the latest AsterixDB, run the cloudberry ingestion [script|https://github.com/JavierJia/cloudberry/blob/full-text/script/ingestTwitterToLocalCluster.sh]. > Only the following query works > {code} > for $t in dataset twitter.ds_tweet > where ftcontains($t.'text', ['zika','virus'], {'mode':'all'}) > let $taggr := $t > group by $g0 := get-interval-start-datetime(interval-bin($t.'create_at', datetime('1990-01-01T00:00:00.000Z'), day-time-duration("PT1H") )) with $taggr > return { > 'hour' : $g0,'count' : count($taggr) > } > {code} > These queries will throw exceptions > {code} > for $t in dataset twitter.ds_tweet > where ftcontains($t.'text', ['zika'], {'mode':'all'}) > let $taggr := $t > group by $g0 := get-interval-start-datetime(interval-bin($t.'create_at', datetime('1990-01-01T00:00:00.000Z'), day-time-duration("PT1H") )) with $taggr > return { > 'hour' : $g0,'count' : count($taggr) > } > {code} > {code} > for $t in dataset twitter.ds_tweet > where ftcontains($t.'text', 'zika') > let $taggr := $t > group by $g0 := get-interval-start-datetime(interval-bin($t.'create_at', datetime('1990-01-01T00:00:00.000Z'), day-time-duration("PT1H") )) with $taggr > return { > 'hour' : $g0,'count' : count($taggr) > } > {code} > {code} > for $t in dataset twitter.ds_tweet > where ftcontains($t.'text', ['happy','day']) > let $taggr := $t > group by $g0 := get-interval-start-datetime(interval-bin($t.'create_at', datetime('1990-01-01T00:00:00.000Z'), day-time-duration("PT1H") )) with $taggr > return { > 'hour' : $g0,'count' : count($taggr) > } > {code} > Exception : > {code} > Caused by: java.lang.IllegalArgumentException > at org.apache.hyracks.util.string.UTF8StringUtil.charAt(UTF8StringUtil.java:60) > at org.apache.hyracks.storage.am.lsm.invertedindex.tokenizers.DelimitedUTF8StringBinaryTokenizer.hasNext(DelimitedUTF8StringBinaryTokenizer.java:48) > at org.apache.asterix.runtime.evaluators.common.FullTextContainsEvaluator$_EvaluatorGen.readLeftAndConductSearch(FullTextContainsEvaluator.java:358) > at org.apache.asterix.runtime.evaluators.common.FullTextContainsEvaluator$_EvaluatorGen.fullTextContainsWithArg(FullTextContainsEvaluator.java:200) > at org.apache.asterix.runtime.evaluators.common.FullTextContainsEvaluator$_EvaluatorGen.evaluate(FullTextContainsEvaluator.java:158) > at org.apache.hyracks.algebricks.runtime.operators.std.StreamSelectRuntimeFactory$1.nextFrame(StreamSelectRuntimeFactory.java:133) > at org.apache.hyracks.dataflow.common.comm.io.AbstractFrameAppender.write(AbstractFrameAppender.java:92) > at org.apache.hyracks.dataflow.common.comm.util.FrameUtils.appendProjectionToWriter(FrameUtils.java:235) > at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.appendProjectionToFrame(AbstractOneInputOneOutputOneFramePushRuntime.java:98) > at org.apache.hyracks.algebricks.runtime.operators.base.AbstractOneInputOneOutputOneFramePushRuntime.appendProjectionToFrame(AbstractOneInputOneOutputOneFramePushRuntime.java:93) > at org.apache.hyracks.algebricks.runtime.operators.std.StreamProjectRuntimeFactory$1.nextFrame(StreamProjectRuntimeFactory.java:74) > at org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$2.nextFrame(AlgebricksMetaOperatorDescriptor.java:134) > at org.apache.hyracks.dataflow.common.comm.io.AbstractFrameAppender.write(AbstractFrameAppender.java:92) > at org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.close(IndexSearchOperatorNodePushable.java:204) > ... 10 more > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)