hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From gaurav jain <jainy_gau...@yahoo.com>
Subject Re: How to output SeqFile
Date Wed, 06 Oct 2010 20:35:14 GMT
I do have that.

However I am not writing directly to the table partition. Instead, I first write 
my data in a tmp directory (eventually moved to the hdfs table partition)  and 
then publish that partition using alter table statement in metastore. 

Something like this:

-- create table x ... stored as SeqFile
-- insert overwrite directory 'd' select * from table y
-- distcp 'd'  x/dateint=.../hour=...
-- alter table x add partition ....

In the second step above I need to produce SeqFile.


Thanks for prompt reply.
Gaurav Jain


----- Original Message ----
From: Yang <teddyyyy123@gmail.com>
To: jainy_gaurav@yahoo.com
Sent: Wed, October 6, 2010 1:28:42 PM
Subject: Re: How to output SeqFile

Gaurav:

not sure if I understand your question correctly....
when you create the output table, that has an option to set the
output table SerDe

Regards
Yang

On Wed, Oct 6, 2010 at 1:18 PM, gaurav jain <jainy_gaurav@yahoo.com> wrote:
>
>
>
>
> How can I produce a sequence file from query
>
> insert overwrite directory ....
>
>
> I have set:
>
> SET io.seqfile.compression.type=BLOCK;
> SET hive.exec.compress.output=true;
> set mapred.output.compression.type=BLOCK;
> set mapred.output.compression.codec=org.apache.hadoop.io.compress.GzipCodec;
>
>
>
> It seems to produce Text .gz format files.
>
>
>
> Regards,
> Gaurav Jain
>
>
>
>



      

Mime
View raw message