arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "worker24h (JIRA)" <>
Subject [jira] [Created] (ARROW-5282) Can't read data from parquet file in C++ library
Date Wed, 08 May 2019 01:39:00 GMT
worker24h created ARROW-5282:

             Summary: Can't read data from parquet file in C++ library
                 Key: ARROW-5282
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: worker24h

Specified the second param *parquet::ReaderProperties* When I used parquet::ParquetFileReader::Open,
it can't work.
 The following code:
parquet::ReaderProperties _properties;
_properties = parquet::ReaderProperties(); 
_properties.enable_buffered_stream();  // used  buffer stream.  Don't set buffer-size
parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties);
int32_t value;
parquet::Int32Reader* int32_reader =
int32_reader->Skip(_current_line_of_group);// skip lines of processed.
rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, &values_read);

The interface *Skip* throw exception:

{color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid data Deserializing
page header failed.}}{color}


This message was sent by Atlassian JIRA

View raw message