asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wenhai (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (ASTERIXDB-1412) Batch import errors
Date Sun, 24 Apr 2016 03:35:12 GMT

     [ https://issues.apache.org/jira/browse/ASTERIXDB-1412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Wenhai updated ASTERIXDB-1412:
------------------------------
    Description: 
When we importing about 2 million ACM datasets with the following schema, we got a data fields
mismatching error. While we splitting the records into 6 parts, the error disappears.
Schema
{noformat}
drop dataverse fuzzytest if exists;

create dataverse fuzzytest;

use dataverse fuzzytest;

create type PaperType as open {
  tid:uuid,
  title: string,
  authors: string?,
  year: int?,
  conf: string?,
  idx: string,
  abstract: string?
}

create dataset ACM(PaperType) primary key tid autogenerated;


  was:
When we importing about 2 million ACM datasets with the following schema, we got a data fields
mismatching error. While we splitting the records into 6 parts, the error disappears.
\noformat
drop dataverse fuzzytest if exists;

create dataverse fuzzytest;

use dataverse fuzzytest;

create type PaperType as open {
  tid:uuid,
  title: string,
  authors: string?,
  year: int?,
  conf: string?,
  idx: string,
  abstract: string?
}

create dataset ACM(PaperType) primary key tid autogenerated;


> Batch import errors
> -------------------
>
>                 Key: ASTERIXDB-1412
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1412
>             Project: Apache AsterixDB
>          Issue Type: Bug
>          Components: Data Formats, Hyracks
>         Environment: Linux ubuntu12.04, 24cores, 128GB memory;
> Configure: 2NCs X 12partitions, 10GB/per NCs
>            Reporter: Wenhai
>
> When we importing about 2 million ACM datasets with the following schema, we got a data
fields mismatching error. While we splitting the records into 6 parts, the error disappears.
> Schema
> {noformat}
> drop dataverse fuzzytest if exists;
> create dataverse fuzzytest;
> use dataverse fuzzytest;
> create type PaperType as open {
>   tid:uuid,
>   title: string,
>   authors: string?,
>   year: int?,
>   conf: string?,
>   idx: string,
>   abstract: string?
> }
> create dataset ACM(PaperType) primary key tid autogenerated;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message