spark-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From gengliangwang <...@git.apache.org>
Subject [GitHub] spark pull request #21829: [SPARK-24876][SQL] Avro: simplify schema serializ...
Date Fri, 20 Jul 2018 13:32:36 GMT
GitHub user gengliangwang opened a pull request:

    https://github.com/apache/spark/pull/21829

    [SPARK-24876][SQL] Avro: simplify schema serialization

    ## What changes were proposed in this pull request?
    
    Previously in the refactoring of Avro Serializer and Deserializer, a new class SerializableSchema
is created for serializing the Avro schema:
    https://github.com/apache/spark/pull/21762/files#diff-01fea32e6ec6bcf6f34d06282e08705aR37
    
    On second thought, we can use `toString` method for serialization. After that, parse the
JSON format schema on executor. This makes the code much simpler.
    
    ## How was this patch tested?
    
    Unit test


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gengliangwang/spark removeSerializableSchema

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/21829.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #21829
    
----
commit f3765031d5f0126780e7c823301392063dea1d2f
Author: Gengliang Wang <gengliang.wang@...>
Date:   2018-07-20T13:06:39Z

    remove SerializableSchema

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Mime
View raw message