arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ernesto Ocampo (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ARROW-2392) pyarrow RecordBatchStreamWriter allows writing batches with different schemas
Date Wed, 04 Apr 2018 16:37:00 GMT
Ernesto Ocampo created ARROW-2392:
-------------------------------------

             Summary: pyarrow RecordBatchStreamWriter allows writing batches with different
schemas
                 Key: ARROW-2392
                 URL: https://issues.apache.org/jira/browse/ARROW-2392
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Ernesto Ocampo


A RecordBatchStreamWriter initialised with a given schema will still allow writing RecordBatches
that have different schemas. Example:

 
{code:java}
schema = pa.schema([pa.field('some_field', pa.int64())])
stream = pa.BufferOutputStream()
writer = pa.RecordBatchStreamWriter(stream, schema)

data = [pa.array([1.234])]
batch = pa.RecordBatch.from_arrays(data, ['some_field'])  
# batch does not conform to schema

assert batch.schema != schema

writer.write_batch(batch)  # no exception raised
writer.close()
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message