asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ildar Absalyamov (JIRA)" <>
Subject [jira] [Created] (ASTERIXDB-2141) Pre-sorted bulkload failure
Date Tue, 24 Oct 2017 00:47:00 GMT
Ildar Absalyamov created ASTERIXDB-2141:

             Summary: Pre-sorted bulkload failure
                 Key: ASTERIXDB-2141
             Project: Apache AsterixDB
          Issue Type: Bug
            Reporter: Ildar Absalyamov
            Assignee: Ian Maxon

Bulkloading pre-sorted input fails due to concurrency issue in hash_partition_merge connector.
The following DDL generates "HYR0046: Unsorted load input" error.
The error is non-deterministic, but the chance of hitting it increases with the length of
the input.
drop dataverse experiments if exists;
create dataverse experiments;
use dataverse experiments;
set hash_merge "true"

create type TweetMessageType as open {
    tweetid: int64
create dataset Tweets(TweetMessageType) primary key tweetid; 
load dataset Tweets using localfs (("path"="asterix_nc1://tweets.adm,asterix_nc2://tweets2.adm"),("format"="adm"))
despite the fact that input splits are individually sorted (tweets.adm and tweets2.adm):

This message was sent by Atlassian JIRA

View raw message