asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Taewoo Kim (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (ASTERIXDB-1208) ngram tokenizer failure with negative length
Date Wed, 02 Dec 2015 03:36:10 GMT

     [ https://issues.apache.org/jira/browse/ASTERIXDB-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Taewoo Kim reassigned ASTERIXDB-1208:
-------------------------------------

    Assignee: Taewoo Kim

> ngram tokenizer failure with negative length
> --------------------------------------------
>
>                 Key: ASTERIXDB-1208
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1208
>             Project: Apache AsterixDB
>          Issue Type: Bug
>          Components: Hyracks Core
>            Reporter: Wenhai
>            Assignee: Taewoo Kim
>
> drop dataverse test if exists;
> create dataverse test;
> use dataverse test;
> create type DBLPOpenType as open {
>   id: int64,
>   dblpid: string,
>   authors: string,
>   misc: string
> }
> create dataset DBLPOpen(DBLPOpenType) primary key id;
> insert into dataset DBLPOpen { "id": 93, "dblpid": "journals/iandc/IbarraJCR91", "authors":
"Some Classes of Languages in NCĀ¹", "misc": "2006-04-25 86-106 Inf. Comput. January 1991
90 1 db/journals/iandc/iandc90.html#IbarraJCR91" }
> use dataverse test;
> set import-private-functions 'true'
> for $d in dataset DBLPOpen
> where similarity-jaccard(gram-tokens("",3,false),gram-tokens($d.title,3,false)) >=
0.5
> return {"rec": $d}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message