hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yuntao Jia (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HIVE-640) Add LazyBinarySerDe to Hive
Date Thu, 30 Jul 2009 20:34:14 GMT

     [ https://issues.apache.org/jira/browse/HIVE-640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Yuntao Jia updated HIVE-640:

    Attachment: HIVE-640.2.patch

Second version patch, there are a few more changes. 
1/ Fixed two bugs in ObjectInspectorUtils.java at Line 465 and 474. At both places, the second
"loi1" should be "loi2". 
2/ Removed the "isNull" tag in all LazyBinaryPrimitive classes. Because currently, primitive
types are only initialized when they are not null, so the tag is not necessary. 
3/ Changed LazyBinaryStruct so that it supports metadata changes. In particular, serialization
data of an old table schema can be deserialized with a new table schema that has more fields
added at the end. For example, if the data is serialized with "Table (key int, field int)",
it can be deserialized with a new table schema "Table (key int, field int, value double)".
The new added last field will be defaulted to null.
4/ Changed the default value of null bits from 1 to 0 to support the above feature 3.
5/ Improved the unit test to cover two more cases:  tests LazyBinaryMap and test serialization
and deserialization with different schemas.
6/ Fixed a few comments

> Add LazyBinarySerDe to Hive
> ---------------------------
>                 Key: HIVE-640
>                 URL: https://issues.apache.org/jira/browse/HIVE-640
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Zheng Shao
>            Assignee: Yuntao Jia
>         Attachments: HIVE-640.1.patch, HIVE-640.2.patch
> LazyBinarySerDe will serialize the data in binary format while supporting LazyDeserialization.
> This will be used as the SerDe for value between map and reduce, and also between different
map-reduce jobs.
> This will help improve the performance of Hive a lot.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message