pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitriy V. Ryaboy (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-3621) Python Avro library can't read Avros made with builtin AvroStorage
Date Thu, 19 Dec 2013 02:24:10 GMT

    [ https://issues.apache.org/jira/browse/PIG-3621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13852500#comment-13852500
] 

Dmitriy V. Ryaboy commented on PIG-3621:
----------------------------------------

+1

> Python Avro library can't read Avros made with builtin AvroStorage
> ------------------------------------------------------------------
>
>                 Key: PIG-3621
>                 URL: https://issues.apache.org/jira/browse/PIG-3621
>             Project: Pig
>          Issue Type: Bug
>          Components: internal-udfs
>    Affects Versions: 0.12.0
>            Reporter: Russell Jurney
>             Fix For: 0.12.1, 0.13.0
>
>         Attachments: PIG-3621-3.patch, PIG-3631-2.patch, PIG-3631.patch
>
>
> Using this script:
> from avro import schema, datafile, io
> import pprint
> import sys
> import json
> field_id = None
> # Optional key to print
> if (len(sys.argv) > 2):
>   field_id = sys.argv[2]
> # Test reading avros
> rec_reader = io.DatumReader()
> # Create a 'data file' (avro file) reader
> df_reader = datafile.DataFileReader(
>   open(sys.argv[1]),
>   rec_reader
> )
> the last line fails with:
> Traceback (most recent call last):
>   File "/Users/rjurney/bin/cat_avro", line 22, in <module>
>     rec_reader
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/avro/datafile.py",
line 247, in __init__
>     self.datum_reader.writers_schema = schema.parse(self.get_meta(SCHEMA_KEY))
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/avro/schema.py",
line 784, in parse
>     return make_avsc_object(json_data, names)
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/avro/schema.py",
line 740, in make_avsc_object
>     return RecordSchema(name, namespace, fields, names, type, doc, other_props)
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/avro/schema.py",
line 653, in __init__
>     other_props)
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/avro/schema.py",
line 294, in __init__
>     new_name = names.add_name(name, namespace, self)
>   File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/avro/schema.py",
line 268, in add_name
>     raise SchemaParseException(fail_msg)
> avro.schema.SchemaParseException: record is a reserved type name.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Mime
View raw message