avro-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Grant Rodgers (JIRA)" <j...@apache.org>
Subject [jira] Updated: (AVRO-554) data files created by ruby DataWriter are extremely large
Date Thu, 27 May 2010 18:08:37 GMT

     [ https://issues.apache.org/jira/browse/AVRO-554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Grant Rodgers updated AVRO-554:
-------------------------------

    Attachment: data3000.avr.gz

Doing some more tests:

1000 records filesize: 7233
2000 records filesize: 14234
3000 records filesize: 13242729

Attached is the file with 3000 records (gzipped)

> data files created by ruby DataWriter are extremely large
> ---------------------------------------------------------
>
>                 Key: AVRO-554
>                 URL: https://issues.apache.org/jira/browse/AVRO-554
>             Project: Avro
>          Issue Type: Bug
>    Affects Versions: 1.3.0, 1.4.0
>         Environment: avro-1.4.0-pre1, ruby 1.8.7 (2010-01-10 patchlevel 249) [x86_64-linux]
>            Reporter: Grant Rodgers
>         Attachments: avro_comp.rb, data10.avr, data100.avr, data3000.avr.gz
>
>
> Adding 10000 records of a very simple schema (3 fields) to a DataWriter results in a
file that is 317mb.  The same records in JSON are 430k.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message