airflow-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Siddharth Anand (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AIRFLOW-50) cx_Oracle insert is not performant
Date Thu, 05 May 2016 15:47:12 GMT

    [ https://issues.apache.org/jira/browse/AIRFLOW-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15272533#comment-15272533
] 

Siddharth Anand commented on AIRFLOW-50:
----------------------------------------

I see an OracleOperator, but that is calling the generic DBApiHook.run().

The OracleHook's 2 methods : *insert_rows* and *bulk_insert_rows* at a quick glance aren't
exposed through the current operator, so I"m guessing he is just using the hook directly in
a python callable from a PythonOperator. 

-s 

> cx_Oracle insert is not performant 
> -----------------------------------
>
>                 Key: AIRFLOW-50
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-50
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: hooks
>    Affects Versions: Airflow 1.7.0
>         Environment: Airflow version: 1.7.0
> Airflow components: OracleHook
> Python Version: 2.7.10/3.4
> Operating System: Linux Centos7 / Mac
>            Reporter: Nam Ngo
>              Labels: performance
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> What did you expect to happen? OracleHook should allow me to insert 1 million rows quickly
(2-4min)
> What happened instead? It's very slow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message