hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth J (JIRA)" <>
Subject [jira] [Commented] (HIVE-6060) Define API for RecordUpdater and UpdateReader
Date Tue, 11 Mar 2014 22:59:43 GMT


Prasanth J commented on HIVE-6060:

[~owen.omalley] HIVE-6578 added support for partialscan and noscan support in analyze statement
for ORC files. When analyze command with partial or noscan is executed, each partition directory
is iterated, creating ORC readers for files under the each directory. Basic statistics like
number of rows, file size, raw data size are computed by reading stats from ORC file footer.
How does HIVE-5317 and HIVE-6060 changes affect HIVE-6578 way of stats gathering?

> Define API for RecordUpdater and UpdateReader
> ---------------------------------------------
>                 Key: HIVE-6060
>                 URL:
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: HIVE-6060.patch, acid-io.patch, h-5317.patch, h-5317.patch, h-5317.patch,
h-6060.patch, h-6060.patch
> We need to define some new APIs for how Hive interacts with the file formats since it
needs to be much richer than the current RecordReader and RecordWriter.

This message was sent by Atlassian JIRA

View raw message