hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sarvesh Sakalanaga" <sarvesh.sakalan...@microsoft.com>
Subject Re: Review Request: Change ORC tree readers to return batches of rows instead of a row
Date Thu, 25 Apr 2013 01:00:57 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/10712/
-----------------------------------------------------------

(Updated April 25, 2013, 1 a.m.)


Review request for hive.


Changes
-------

Added java doc on nextVector function and nextBatch function based on Eric’s comments.


Description
-------

The patch contains changes to ORC reader to return a batch of rows instead of a row. A new
method called nextBatch() is added to ORC reader and tree readers of ORC. Currently only int,long,short,double,float,string
and struct support batch processing.


This addresses bug HIVE-4370.
    https://issues.apache.org/jira/browse/HIVE-4370


Diffs (updated)
-----

  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/BytesColumnVector.java 246170d 
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicByteArray.java fc4e53b 
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReader.java 05240ce 
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java d044cd8 
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthIntegerReader.java 2825c64 
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestVectorizedORCReader.java PRE-CREATION 

Diff: https://reviews.apache.org/r/10712/diff/


Testing
-------


Thanks,

Sarvesh Sakalanaga


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message