hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aaron Wiebe <epiph...@gmail.com>
Subject Re: Issue with job serialization formats mangling results
Date Fri, 23 Oct 2015 21:20:57 GMT
Right on - that solved it.  Thanks Gopal.

On Fri, Oct 23, 2015 at 3:31 PM, Gopal Vijayaraghavan <gopalv@apache.org> wrote:
>
>
>>I've then created ORC and Parquet versions of this same table.  The
>>behavior remains... select * works, any filter creates horribly
>>mangled results.
>>
>>To replace- throw this into a file:
>>
>>{"id":1,"order_id":8,"number":1,"broken":"#\n---\nstuff\nstuff2:
>>\"stuff3\"\nstuff4: '730'\nstuff5: []\n","last":null}
>
> You're trying to fix the issue on the wrong side of the problem, I think.
>
> Try with
>
> set
> hive.default.serde=org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe
> ;
> set hive.query.result.fileformat=SequenceFile;
>
>
> Hopefully we'll have a newer & more compact format for results soon.
>
> Cheers,
> Gopal
>
>

Mime
View raw message