airflow-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AIRFLOW-1350) Add "query_uri" parameter for Google DataProc operator for both SparkSQL and Hive jobs
Date Tue, 27 Jun 2017 19:45:00 GMT

    [ https://issues.apache.org/jira/browse/AIRFLOW-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16065360#comment-16065360
] 

ASF subversion and git services commented on AIRFLOW-1350:
----------------------------------------------------------

Commit d32c7296908e6975c4dda7159c1a7a6b9e89f046 in incubator-airflow's branch refs/heads/master
from [~lfalsina]
[ https://git-wip-us.apache.org/repos/asf?p=incubator-airflow.git;h=d32c729 ]

[AIRFLOW-1350] Add query_uri param to Hive/SparkSQL DataProc operator

Closes #2402 from lukeFalsina/master


> Add "query_uri" parameter for Google DataProc operator for both SparkSQL and Hive jobs
> --------------------------------------------------------------------------------------
>
>                 Key: AIRFLOW-1350
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-1350
>             Project: Apache Airflow
>          Issue Type: Improvement
>          Components: gcp, operators
>            Reporter: Luca Falsina
>            Assignee: Luca Falsina
>             Fix For: 1.9.0
>
>
> Google DataProc APIs allow to provide as an input for some types of job (Pig, SparkSQL,
Hive) a "query_file" parameter pointing to a file stored in Google Cloud Storage.
> The current implementation of the DataProc operator allows this functionality only for
Pig jobs. The code modification should be trivial as I plan to just reuse the same implementation
in SparkSQL and Hive operator.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message