arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "worker24h (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ARROW-5282) Can't read data from parquet file in C++ library
Date Wed, 08 May 2019 01:39:00 GMT
worker24h created ARROW-5282:
--------------------------------

             Summary: Can't read data from parquet file in C++ library
                 Key: ARROW-5282
                 URL: https://issues.apache.org/jira/browse/ARROW-5282
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: worker24h


Specified the second param *parquet::ReaderProperties* When I used parquet::ParquetFileReader::Open,
it can't work.
 The following code:
{code:java}
parquet::ReaderProperties _properties;
_properties = parquet::ReaderProperties(); 
_properties.enable_buffered_stream();  // used  buffer stream.  Don't set buffer-size
parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties);
...
int32_t value;
parquet::Int32Reader* int32_reader =
static_cast<parquet::Int32Reader*>(column_reader.get());
int32_reader->Skip(_current_line_of_group);// skip lines of processed.
rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, &values_read);
 

{code}
The interface *Skip* throw exception:

{color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid data Deserializing
page header failed.}}{color}

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message