airflow-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AIRFLOW-1882) Add ignoreUnknownValues option to gcs_to_bq operator
Date Fri, 16 Feb 2018 11:37:00 GMT

    [ https://issues.apache.org/jira/browse/AIRFLOW-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16366878#comment-16366878
] 

ASF subversion and git services commented on AIRFLOW-1882:
----------------------------------------------------------

Commit c739adc623818287e8e7de7017aa3a2af085912e in incubator-airflow's branch refs/heads/master
from [~kaxilnaik]
[ https://git-wip-us.apache.org/repos/asf?p=incubator-airflow.git;h=c739adc ]

[AIRFLOW-1882] Add ignoreUnknownValues option to gcs_to_bq operator

- Added `ignore_unknown_values` to
`run_load` method in `BigQuery Hook`
- Added `ignore_unknown_values` to
`GoogleCloudStorageToBigQueryOperator`

Closes #3042 from kaxil/AIRFLOW-1882


> Add ignoreUnknownValues option to gcs_to_bq operator
> ----------------------------------------------------
>
>                 Key: AIRFLOW-1882
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-1882
>             Project: Apache Airflow
>          Issue Type: Improvement
>          Components: contrib, gcp
>    Affects Versions: 1.8.2
>            Reporter: Yannick Einsweiler
>            Assignee: Kaxil Naik
>            Priority: Major
>              Labels: gcp
>
> Would allow to load csv's that have columns not defined in schema. For instance when
lines end with a dummy/extra separator. BigQuery considers it as an extra column and won't
load the file if option is not passed. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message