spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gengliang Wang (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-24330) Refactor ExecuteWriteTask in FileFormatWriter with DataWriter(V2)
Date Mon, 21 May 2018 16:27:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-24330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Gengliang Wang updated SPARK-24330:
-----------------------------------
    Description: 
Refactor ExecuteWriteTask in FileFormatWriter to reduce common logic and improve readability.
After the change, callers only need to call {{commit()}} or {{abort}} at the end of task.
Also there is less code in {{SingleDirectoryWriteTask}} and {{DynamicPartitionWriteTask}}.

Definitions of related classes are moved to a new file, and {{ExecuteWriteTask}} is renamed
to {{FileFormatDataWriter}}.

  was:
As I am working on File data source V2 write path in my repo [https://github.com/gengliangwang/spark/tree/orcWriter] ,
I find it essential to refactor ExecuteWriteTask in FileFormatWriter with DataWriter of Data
source V2:
 # Reuse the code in both `FileFormat` and Data Source V2
 # Better abstraction, callers only need to call `commit()` or `abort` at the end of task.

 

 


> Refactor ExecuteWriteTask in FileFormatWriter with DataWriter(V2)
> -----------------------------------------------------------------
>
>                 Key: SPARK-24330
>                 URL: https://issues.apache.org/jira/browse/SPARK-24330
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.3.1
>            Reporter: Gengliang Wang
>            Priority: Major
>
> Refactor ExecuteWriteTask in FileFormatWriter to reduce common logic and improve readability.
> After the change, callers only need to call {{commit()}} or {{abort}} at the end
of task.
> Also there is less code in {{SingleDirectoryWriteTask}} and {{DynamicPartitionWriteTask}}.
> Definitions of related classes are moved to a new file, and {{ExecuteWriteTask}} is
renamed to {{FileFormatDataWriter}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message