db-derby-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rick Hillegas (JIRA)" <j...@apache.org>
Subject [jira] Commented: (DERBY-4555) Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines
Date Fri, 19 Feb 2010 17:16:27 GMT

    [ https://issues.apache.org/jira/browse/DERBY-4555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12835830#action_12835830
] 

Rick Hillegas commented on DERBY-4555:
--------------------------------------

A more general solution to this problem might be to give the user the ability to turn off
logging for a while, maybe just for the duration of an INSERT. Then bulk inserts could enjoy
the performance advantages of the IMPORT procedures and be driven by arbitrary table functions.
For this particular enhancement, the table function would throw away the header line. The
general solution could be used to support table functions which siphon data out of foreign
data stores--that would eliminate the indirection through a CSV file altogether and make the
bulk insert even faster.

> Expand SYSCS_IMPORT_TABLE to accept CSV file with header lines
> --------------------------------------------------------------
>
>                 Key: DERBY-4555
>                 URL: https://issues.apache.org/jira/browse/DERBY-4555
>             Project: Derby
>          Issue Type: Bug
>          Components: Miscellaneous
>            Reporter: Yair Lenga
>
> The SYSCS_IMPORT_TABLE (and SYSCS_IMPORT_DATA) function allow import of data from external
resources. In general, they can process CSV files that created with various tools - with one
exception: the header line.
> While there is no accepted standard, most tools will include a header line in the CSV
file with column names. This convention is supported in Excel and many other tools.
> My Request: extend the SYSCS_IMPORT_TABLe and SYSCS_IMPORT_DATA (and other related procedures)
to include an extra indicator for the number of header lines to be ignored.
> As an extra bonus it will be help is the SYSCS_IMPORT_DATA will accept column names (instead
of column indexes) in the 'COLUMNINDEXES' arguments. E.g., it should be possible to indicate
COLUMNINDEXES of '1,3,sales,5,'. This feature will make it significantly easier to handle
cases where the external input files is extended to include additional columns.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message