db-derby-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Danoja Dias (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DERBY-4555) Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines
Date Mon, 20 Jun 2016 11:05:05 GMT

    [ https://issues.apache.org/jira/browse/DERBY-4555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15339348#comment-15339348
] 

Danoja Dias commented on DERBY-4555:
------------------------------------

I think SYSCS_IMPORT_TABLE_BULK will be better. What do you think Bryan? 
Any suggestion for a name?

The second request is not included. Bryan, Can we file a new JIRA for the 2nd request as
it is different when considering the implementation is it better to do this here ? 

When expanding SYSCS_IMPORT_DATA to accept files with header lines we need
to add another system procedure for it.

When I run the existing tests with the new patch, tests were clean.I need to add new
tests. 

> Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines
> --------------------------------------------------------------
>
>                 Key: DERBY-4555
>                 URL: https://issues.apache.org/jira/browse/DERBY-4555
>             Project: Derby
>          Issue Type: Improvement
>          Components: Miscellaneous
>            Reporter: Yair Lenga
>            Assignee: Danoja Dias
>         Attachments: NoVarargs.diff, Varargs.diff, addNewSystemProcedure_1.diff, gotException.diff,
hardCoded.diff, latest.diff, noHeaderLines.csv, petlist.csv, petlist.csv, petlist.csv, repro.java,
repro.java, repro.java, skipHeaders.diff
>
>
> The SYSCS_IMPORT_TABLE (and SYSCS_IMPORT_DATA) function allow import of data from external
resources. In general, they can process CSV files that created with various tools - with one
exception: the header line.
> While there is no accepted standard, most tools will include a header line in the CSV
file with column names. This convention is supported in Excel and many other tools.
> My Request: extend the SYSCS_IMPORT_TABLe and SYSCS_IMPORT_DATA (and other related procedures)
to include an extra indicator for the number of header lines to be ignored.
> As an extra bonus it will be help is the SYSCS_IMPORT_DATA will accept column names (instead
of column indexes) in the 'COLUMNINDEXES' arguments. E.g., it should be possible to indicate
COLUMNINDEXES of '1,3,sales,5,'. This feature will make it significantly easier to handle
cases where the external input files is extended to include additional columns.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message