hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ajay Chander <hadoopde...@gmail.com>
Subject Re: Hive_CSV
Date Wed, 09 Mar 2016 18:09:23 GMT
Daniel, thanks for your time. Is it like creating two tables, one is to get
all the data and the another one is to fetch the required data out of it?
If that is the case I was just concerned of redundant data. Please correct
me if I am wrong. Thanks

On Wednesday, March 9, 2016, Daniel Haviv <daniel.haviv@veracity-group.com>
wrote:

> Hi Ajay,
> Use the CSV serde to read your file, map all three columns but only select
> the relevant ones when you insert:
>
> Create table csvtab (
> irrelevant string,
> sportName string,
> sportType string) ...
>
> Insert into loaded_table select sportName, sportType from csvtab;
>
> Daniel
>
> > On 9 Mar 2016, at 19:43, Ajay Chander <hadoopdev18@gmail.com
> <javascript:;>> wrote:
> >
> > Hi Everyone,
> >
> > I am looking for a way, to ignore the first occurrence of the delimiter
> while loading the data from csv file to hive external table.
> >
> > Csv file:
> >
> > Xyz, baseball, outdoor
> >
> > Hive table has two columns sport_name & sport_type and fields are
> separated by ','
> >
> > Now I want to load by data into table such that while loading it has to
> ignore the first delimiter that ignore xyz and load the data from second
> delimiter.
> >
> > In the end my hive table should have the following data,
> >
> > Baseball, outdoor .
> >
> > Any inputs are appreciated. Thank you for your time.
>

Mime
View raw message