Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id F0606200B46 for ; Sat, 11 Jun 2016 03:08:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id ED3CF160A5A; Sat, 11 Jun 2016 01:08:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3FE9B160A38 for ; Sat, 11 Jun 2016 03:08:22 +0200 (CEST) Received: (qmail 78654 invoked by uid 500); 11 Jun 2016 01:08:21 -0000 Mailing-List: contact derby-dev-help@db.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: Delivered-To: mailing list derby-dev@db.apache.org Received: (qmail 78624 invoked by uid 99); 11 Jun 2016 01:08:21 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Jun 2016 01:08:21 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 238492C1F68 for ; Sat, 11 Jun 2016 01:08:21 +0000 (UTC) Date: Sat, 11 Jun 2016 01:08:21 +0000 (UTC) From: "Bryan Pendleton (JIRA)" To: derby-dev@db.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (DERBY-4555) Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Sat, 11 Jun 2016 01:08:23 -0000 [ https://issues.apache.org/jira/browse/DERBY-4555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Pendleton updated DERBY-4555: ----------------------------------- Attachment: repro.java noHeaderLines.csv Attached is the repro.java that I am using, and the noHeaderLines.csv file that I used. I also added a 'println' to SystemProcedures.SYSCS_IMPORT_TABLE so I could see that I was running the code I thought I was running. When I run with that configuration, with Varargs.diff applied and built, I see: [bpendleton@Fedora22 DERBY-4555]$ rm -rf derbyDB/ ; java -cp /home/bpendleton/derby/trunk/classes:. repro repro starting in embedded mode Created database derbyDB Created table A SYSCS_IMPORT_TABLE: replace[0]=0 Table is imported Rover, Dog,4 Spot, cat,2 Squawky, Parrot,37 repro finished > Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines > -------------------------------------------------------------- > > Key: DERBY-4555 > URL: https://issues.apache.org/jira/browse/DERBY-4555 > Project: Derby > Issue Type: Improvement > Components: Miscellaneous > Reporter: Yair Lenga > Assignee: Danoja Dias > Attachments: NoVarargs.diff, Varargs.diff, gotException.diff, hardCoded.diff, latest.diff, noHeaderLines.csv, petlist.csv, petlist.csv, petlist.csv, repro.java, repro.java > > > The SYSCS_IMPORT_TABLE (and SYSCS_IMPORT_DATA) function allow import of data from external resources. In general, they can process CSV files that created with various tools - with one exception: the header line. > While there is no accepted standard, most tools will include a header line in the CSV file with column names. This convention is supported in Excel and many other tools. > My Request: extend the SYSCS_IMPORT_TABLe and SYSCS_IMPORT_DATA (and other related procedures) to include an extra indicator for the number of header lines to be ignored. > As an extra bonus it will be help is the SYSCS_IMPORT_DATA will accept column names (instead of column indexes) in the 'COLUMNINDEXES' arguments. E.g., it should be possible to indicate COLUMNINDEXES of '1,3,sales,5,'. This feature will make it significantly easier to handle cases where the external input files is extended to include additional columns. -- This message was sent by Atlassian JIRA (v6.3.4#6332)